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(57) Abstract 

Four novel members of the EPH sub- 
family of receptor protein tyrosine kinases are 
disclosed. Nucleic acid sequences encoding 
receptor proteins, recombinant plasmids and 
host cells for expression, and methods of 
producing and using such receptors are also 
disclosed. 
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HEK5, HEK7, HEK8, HEK11, new EPH-Hke receptor protein tyrosine kinases 

i 

I 

Field of thg Invention 
5 The invention relates , generally to receptor 

protein tyrosine kinases (PTKs) and particularly to 
novel Eph-like receptor PTKs, to fragments and analogs 
thereof, and to nucleic acids encoding same. The 
present invention also relates to methods of producing 
10 and using such receptors. 

t 

Packground of the TirrentAon 

Receptor PTKs are a structurally related 

15 family of proteins that mediate the response of cells to 
extracellular signals (Ullrich et al. Cell £1, 203-212 
(1990)). These receptors are characterized by three 
major functional domains: an intracellular region 
containing the sequences responsible for catalytic 

20 activity, a single hydrophobic membrane-spanning domain, 
and a glycosylated extracellular region whose structure 
determines ligand binding specificity. Signal 
transduction is initiated by the binding of growth or 
differentiation factors to the extracellular domain of 

25 their cognate receptors. Ligand binding facilitates 

dimerization of the receptor which can induce receptor 
autophosphorylation. Both soluble and membrane- 
associated protein ligands have been shown to function 
in this manner. This process is the initial step in a 

30 cascade of interactions involving the phosphorylation of 
a variety of cytoplasmic substrates and culminating in a 
biological response by the cell. The best characterized 
response to tyrosine kinase receptor activation is cell 
growth. However, analysis of the role of some growth 

35 factors in vivo suggests that differentiation or cell 
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survival might also be mediated by tyrosine kinase 
receptor/ ligand interactions. 

Receptor PTKs have been grouped into fairly 
5 well-defined families on the basis of both sequence 
•homology and shared structural motifs. The amino acid 
sequence of the portion of the intracellular domain 
responsible for the catalytic activity is well conserved 
among all tyrosine kinases and even more closely matched 

10 within a receptor sub-family. Comparisons of this 

portion of the amino acid sequence have been used to 
construct phylogenetic trees depicting the relatedness 
of family members to each other and to the tyrosine i 
kinases as a whole (Hanks and Quinn, Methods Enzymol. 

15 200 . 38-62 (1991)). This sequence conservation has also 
been exploited in order to isolate new tyrosine kinases 
using the polymerase chain reaction (PCR) (Wilks, Proc. 
Natl. Acad. Sci. USA 1603-1607 (1989)). 

Oligonucleotides based on the highly conserved catalytic 

20 domain of PTKs can be used as PCR primers to amplify ( 
related sequences present in the template. These 
fragments can then be used as probes for isolation of 
the corresponding full-length receptor clones from cDNA 
libraries. Anti-phosphotyrosine antibodies have also 

25 been used to identify PTK cDNA clones in phage 

expression libraries (Lindberg and Pasquale, Methods 
Enzymol. 200 r 557-564 (1991)). These strategies have 
been used by a number of investigators to identify an 
ever- increasing number of protein tyrosine kinase 

30 receptors. 

There are now 51 distinct PTK receptor genes 
that have been published and divided into 14 
sub-families One such sub-family is the EPH-like 
35 receptors. The prototype member, EPH, was isolated by 
Hirai et.al. (Science 238 . 1717-1720 (1987)) using low 
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stringency hybridization to a probe derived from the 
viral oncogene v-fps. EPH-like receptors have been 
implicated in cell growth based in part on studies which 
show that, overexpression o^ the gene in NIH3T3 cells 
5 causes focus formation in soft agar and tumors in nude 
mice (Maru et al. Oncogene £, 199-204 (1990)). Other 
members of the EPH sub-family which have been identified 
include the following:' 1 
_ ECK- (Lindberg et al. Mol. Cell. Biol. 1£, , 

10 6316-6324 (1990)) 

Elk (Lhot&k et al. Mol. Cell. Biol. 11, 2496- 

2502 (1991)) " . v , 

Ceks 4,5/6,7,8,9, and 10 (Pasquale, Cell 

Regulation 2, 523-534 (1991); Sajjadi et al. The New 
15 Biologist 3, 769-778 (1991); Sajjadi and Pasquale 

Oncogene Sl, 1807-1813 (1993) ) 

HEK2 (Bohme et al. Oncogene £, 2857-2862 



(1993) ) 
20 (1991)) 



Eek, Erk (Chan and Watt, Oncogene 1057-1061 



Ehkl, Ehk2 (Maisonpierre et al. Oncogene £, 
3277-3288 (1993)) 

Homologs for some of these receptors have been 
25 identified in other species (Wicks et al. Proc. Natl. 
Acad. Sci. USA £2/ 1611-1615 (1992)); Gilardi- 
Hebenstreit et al. Oncogene 2, 2499-2506 (1992)). The 
- expression patterns and developmental profiles of 
several family members suggest that these receptors and 
30 their ligands are important for the proliferation, 

differentiation and maintenance of a variety of tissues 
(Nieto et al. Development H£, 1137-1150 (1992)). 
Structurally, EPH sub-family members are characterized 
by an Ig-like loop, a cysteine rich region, and two 
35 fibronectin-type repeats in their extracellular domains. 
The amino acid sequences of the catalytic domains are 
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more closely related to the SRC sub-family of 
cytoplasmic PTKs than to any of the receptor PTKs. 
Among the catalytic domains of .receptor PTKs, the EPH 
sub-family is most similar in amino acid sequence to the 
5 epidermal .growth factor receptor sub- family. 

It is an object of the invention to identify 
novel receptors belonging to the EPH sub- family. A 
directed PCR approach has been used to identify five 
10 human EPH-like receptors from a human fetal brain cDNA 
library. These receptors are designated HEK4, HEK5, 
HEK7, HEK8, and HEK11. The relationship of these 
receptors to previously identified EPH-like receptors is 
as follows: 

15 HEK4 is 'the human homolog of Cek4 (chicken) 

and Mek4 (mouse) and is identical to HEK (Boyd et al. J. 
Biol. Chem. 2£2, 3262-3267 (1992); Wicks et al., 1992) 
which was previously isolated from a human lymphoid 
tumor cell line. , 

20 HEK5 is the human homolog of Cek5,' a full- 

length eph-like receptor clone from chicken. A portion 
of the HEK5 sequence was previously disclosed as ERK, a 
human clone encoding about sixty amino acids (Chan and 
Watt, 1991) 

25 HEK7 is the human homolog of Cek7 isolated 

from chicken. 

HEK8 is the human homolog of Cek8 a full- 
length clone from chicken and Sek, a full-length clone 
from mouse. (Nieto et al., 1992; Sajjadi et al., 1991) 

30 HEK11 does not have a known non-human homolog. 

With the addition of the new members HEK5, HEK7, HEK8 
and HEK11 and the report of a PCR fragment encoding an 
eph-like receptor (Lai & Lemke Neuron £, 691-704 
(1991)), a total of twelve distinct sequences that 

35 represent EPH-like receptors have been published, making 
it the largest known sub- family of PTKs. 
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It is a further object of the invention to 
generate soluble EPH-like receptors and antibodies to 
EPH-like receptors- Soluble receptors and antibodies 
are useful for modulating EPH-like receptor activation. 

5 

Summary of the Invention 

The present invention provides novel EPH-like 
receptor protein tyrosine kinases. More particuliarly, 
the invention provides isolated IfmcTet c~acids"~encdding 

10 four novel members of the sub-family of EPH-like 

receptor PTKs which are referred to collectively as HEKs 
(human-eph like kinases) . Also encompassed are nucleic 
acids which hybridize under stringent conditions to 
EPH-like receptor 'nucleic acids. Expression vectors and 

15 host cells for the production of receptor polypeptides 
and methods of producing receptors are also provided. 

Isolated polypeptides having amino acid 
sequences of EPH-like receptors are also provided, as 
are fragments and analogs thereof. Antibodies 

20 specifically binding the polypeptides of the invention 
are included. Also comprehended by the invention are 
methods of modulating the endogenous activity of an 
EPH-like receptor and methods for identifying receptor 
ligands. 

25 

Descript ion of thg Figures 

Figure 1 shows the nucleotide and predicted amino acid 
sequence of the HEKS receptor. 

30 Figure 2 shows the nucleotide and predicted amino acid 

sequence of the HEK7 receptor.- 

Figure 3 shows the nucleotide and predicted amino acid 

sequence of the HEK8 receptor. 
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Figure 4 shows the nucleotide ' and predicted amino acid 
sequence of the HEK11 receptor. 

Figure 5 shows the comparison of the amino acid 
5 sequences 'of the human EPH receptor sub-family. The 
multiple sequence alignment was -done using the LineUp 
program included in the Genetics Computer Group sequence 
analysis software package (Genetics Computer Group, 
(1991) , Program Manual for the GCG Package, Version 7, 

10 April 19191, Madison, Wisconsin, USA 53711) . Dots 
indicate spaces introduced in order to optimize 
alignment. The predicted transmembrane domains and 
signal* sequences of each - receptor are indicated by ( 
underlining and italics, respectively. Cysteine 

15 residues conserved throughout the sub- family are 

indicated with asterisks. Arrows indicate the tyrosine 
kinase catalytic domain. Amino acid sequences of EPH, 
ECK and HEK2 were taken from the appropriate literature 
references. , ( i 

20 

! 

Figure 6 shows the molecular phylogeny of the EPH sub- 
family of receptor protein tyrosine kinases. Catalytic 
domain sequences were analyzed as described by Hanks and 
Quinn, 1991. The scale bar represents an arbitrary 
25 evolutionary difference unit. The EPH branch, which has 
been shown with a discontinuity for the sake of 
compactness, is 23.5 units in length. 

Figures 7-11 show Northern blot analyses of the tissue 
30 distribution of the HEK receptors. Receptor cDNA 

probes, labeled with 32 P, were hybridized to either 2 |ig 
of poly A + RNA from human tissues (panel A, Clontech) or 
10 \ig of total RNA from rat tissues (panel B) . Sizes of 
the transcripts were determined by comparison with RNA 
35 molecular weight markers (Bethesda Research Labs, 
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Gaithersburg, MD) . Figure 7, HEK4; Figure 8, HEK5; 
Figure 9, HEK7; Figure 10; HEK8; Figure 11; HEK 11. 

potaii^ri Descr iption of the invention 

5 The present invention relates to novel 

EPH-like receptor protein tyrosine kinases. More 
particularly, the invention relates to isolated nucleic 
acids encoding four novel members of the sub-family of 
EPH-like receptor PTKs. These four members are 

10 designated herein as HEK (human eph-like kinases) . 

Nucleic acids encoding HEK receptors were identified in 
a hu man fetal_brain cDNA_library using oligonucleotide 
probes to conserved regions of receptor PTKs and EPH- 
like receptor PTKsi. The predicted amino acid sequences 

15 of three HEK receptors had extensive homology in the 
catalytic domain to previously identified EPH-like 
receptors Cek5, Cek7 and Cek8 isolated from chicken and, 
accordingly, are designated HEK5, HEK7 and HEK8. The 
predicted amino acid sequence of the fourth , HEK receptor 

20 revealed that it was not a homolog of any previously 
identified EPH-like receptor. It is designated HEK11. 
It is understood that the term "HEKs" comprises HEK5, 
HEK7, HEK8 and HEK11 as well as analogs, variants, and 
mutants thereof which fall within the scope of the 

25 invention. 

The invention encompasses isolated nucleic 
acids selected from the group consisting of: 

(a) the nucleic acids set forth in any of SEQ 
30 ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, or SEQ ID NO: 

16 and their complementary strands; 

(b) a nucleic acid hybridizing to the coding 
regions of the nucleic acids in any of SEQ ID NO: 10, 
SEQ ID NO: 12, SEQ ID NO: 14, or SEQ ID NO: 16 under 

35 stringent conditions; and 
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(c) a nucleic acid of (b) which, but for the 
degeneracy of the genetic code, would hybridize to the 
coding regions of the nucleic acids in any of SEQ ID NO: 
10, SEQ ID NO: 12, SEQ ID NQ: 14, or SEQ ID NO: 16. 
The nuclei'c acids of the invention preferably hybridize 
to HEK5, HEK7, HEK8, or HEK11 coding regions under 
conditions allowing up to about 5% nucleotide mismatch 
based upon observed nucleic ,acid identities among known 
kyi?. 311 . 9 r . ( nonhuman EPH-like receptors . An example of 
such a condition is hybridization at 60* in 1M Na+ 
followed by washing at 60* in 0.2XSSC. Other 
hybridization conditions 'may be ascertained by one 
skilled in the art which allow base pairing with similar 
levels of mismatch. 
15 ! In a preferred embodiment, the isolated 

nucleic acids encode polypeptides having the amino acid 
sequences of HEK5, HEK7, HEK8 or HEK11. A nucleic acid 
includes cDNA, genomic DNA, synthetic DNA or RNA. 
Nucleic acids of this invention, may encode full-length 
receptor polypeptides having an extracellular 
ligand-binding domain, a transmembrane domain, and a 
cytoplasmic domain, or may encode fragments such as 
extracellular domains which are produced in a soluble, 
secreted form. Nucleic acid constructs which produce 
25 soluble HEK receptors are described in Example 3. 

Polypeptides and fragments encoded by the nucleic acids 
have at least one of the biological activities of an 
EPH-like receptor protein tyrosine kinase, such as the 
ability to bind ligand. 



20 



30 

The invention also encompasses nucleic acids 
encoding chimeric proteins wherein said proteins 
comprise part of the amino acid sequence of a HEK 
receptor linked to an amino acid sequence from a 
35 heterologous protein. One example of such a chimeric 
protein is an extracellular domain of a HEK receptor 
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fused to a heterologous receptor cytoplasmic domain. 
Example 5 describes the construction and expression of a 
chimeric receptor comprising the HEK8 extracellular 
domain with the trkB cytoplasmic domain and a second 
chimeric receptor comprising the HEK11 extracellular 
domain with the trkB cytoplasmic domain. HEK receptors 
may also be ftised to other functional protein domains, 
such as an Ig ciomain which acts as an antibody 
recognition site. 



10 



The nucleic acids of the present invention may 
be linked to heterologous nucleic acids^ which provide 
expression of receptor PTKs. Such heterologous nucleic 
acids include biologically functional plasmids or viral 

15 vectors which provide genetic elements for 

transcription, translation, amplification, secretion, 
etc. One example of an expression vector suitable for 
producing EPH-like receptors of the present invention is 
pDSRa which is described in Example 3. It is understood 

20 that other vectors are also suitable for expression of 
EPH-like receptors in mammalian, yeast, insect or 
bacterial cells. In addition, in vivo expression of 
nucleic acids encoding EPH-like receptor PTKs is also 
encompassed. For example, tissue-specific expression of 

25 EPH-like receptors in transgenic animals may be readily 
effected using vectors which are functional in selected 
tissues. 

Host cells for the expression of EPH-like 
30 receptor PTKs will preferably be established mammalian 

cell lines, such as Chinese Hamster Ovary (CHO) cells or 
NIH 3T3 cells, although other. cell lines suitable for 
expression of mammalian genes are readily available and 
may also be used. Such host cells are transformed or 
35 transfected with nucleic acid constructs suitable for 
expression of an EPH-like receptor. Transformed or 
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transfected host cells may be used to produce suitable 
quantities of receptor for diagnostic or therapeutic 
uses and to effect targeted expression of EPH-like 
receptors in selected adult tissues, such as brain, 
kidney, and liver, or in embryonic or rapidly dividing 
tissues. 

. i 

The present invention provides purified and 
isolated polypeptides having at least one of the 
biological properties of an EPH-like receptor (e.g. 
ligand binding, signal transduction) . The isolated 
polypeptides will preferably have an amino acid sequence 
as shown in any of SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID 
NO: 14 or . SEQ ID NO: 16. Polypeptides of this invention 
15 may be full-length 'polypeptides having an extracellular 
domain, a transmembrane domain, and a cytoplasmic 
domain, or may be fragments thereof, e.g., those having 
only an extracellular domain or a portion thereof, it 
will be understood , that the receptor polypeptides may 
also be analogs or naturally-occurring variants of the 
amino acid sequences shown in SEQ ID NO: 10, SEQ ID NO: 
12, SEQ ID NO: 14 or SEQ ID NO: 16. Such analogs are 
generated by amino acid substitutions, deletions and/or 
insertions using methods available in the art. 

Polypeptides of the invention are preferably 
the product of expression of an exogenous DNA sequences, 
i.e., EPH-like receptors are preferably produced by 
recombinant means. Methods of producing EPH-like 
receptors comprising culturing host cells which have 
been transformed or transfected with vectors expressing 
an EPH-like receptor are also encompassed. EPH-like 
receptors, particularly fragments, may also be produced 
by chemical synthesis. The polypeptides so produced may 
be glycosylated or nonglycosylated depending upon the 
35 host cell employed, or may have a methionine residue at 
the amino terminal end. The polypeptides so produced 
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are identified and recovered from cell cultures 
employing methods which are conventional in the art. 

EPH-like receptors of the present invention 
are used for the production of antibodies to the 
5 receptors.' Antibodies to HEK receptors have been 

described in Example 4 . Antibodies which recognize the 
polypeptides of the invention may be polyclonal or 
monoclonal and may be binding fragments or chimeric 

—anti-bodies-. Such -antibodies-are-use^u-1-J.n the detection 

10 of EPH-like receptors in diagnostic assays in the 
purification of receptor, and in the modulation of 
EPH-like receptor activation. 

As described in co-pending and co-owned U.S. 
15 Serial No. 08/145,616, the only known ligand for an 
EPH-like receptor is a protein which binds to and 
induces phosphorylation of the eck receptor. The ECK 
receptor ligand was previously identified as B61. 
(Holzman et al. Mol. Cell. Biol. 1£, 5830-5838 (1990)). 
20 The availability of ECK receptor was important for the 
identification of a ligand since B61, although known, 
had not been previously implicated as an ECK receptor 
ligand. Therefore, EPH-like receptors having ligand 
binding domains are useful for the identification and 
25 purification of ligands. Polypeptides of the present 

invention may be used to identify and purify ligands for 
HEK5, HEK7, HEK 8 and HEK11 receptors. Binding assays 
for the detection of potential ligands may be carried 
out in solution or by receptor immobilization on a solid 
30 support using methods such as those described in 

co-pending and co-owned U.S. Serial No. 08/145,616. 
Such assays may employ an isolated ligand binding domain 
of a HEK receptor. Alternatively, a HEK ligand binding 
domain fused to an Ig domain may be used to detect the 
35 presence of HEK ligand on cell surfaces. 
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Soluble EPH-like receptors may be used to 
modulate (i.e., increase or decrease) the activation of 
the cell-associated receptors, typically by competing 
with the receptor for unbound ligand. Modulation of 
EPH-like receptor activation may, in turn alter the 
proliferation and/or differentiation of receptor-bearing 
cells. For example, based upon the observed tissue 
distribution of i the receptors of this invention (see 
Table 5) , soluble HEK7 receptor is likely to primarily 
affect proliferation and/or differentiation of brain 
cells, while soluble HEK5 receptor may affect primarily 
brain and pancreatic cells, although effects of HEK5 
receptor on other tissues may not be excluded. i 

Antibodies to EPH-like receptors are useful 
reagents for the detection of receptors in different 
cell types using immunoassays conventional to the art. 
Antibodies are also useful therapeutic agents for 
modulating receptor activation. Antibodies may bind to 
the receptor so as to directly or indirectly block 
ligand binding and thereby act as an antagonist of 
receptor activation. Alternatively, antibodies may act 
as an agonist by binding to receptor so as to faciliate 
ligand binding and bring about receptor activation at 
lower ligand concentrations. in addition, antibodies of 
25 the present invention may themselves act as a ligands by 
inducing receptor activation, it is also contemplated 
that antibodies to EPH-like receptors are useful for 
selection of cell populations enriched for EPH-like 
receptor bearing cells. Such populations may be useful 
in cellular therapy regimens where it is necessary to 
treat patients which are depleted for certain cell 
types. 



15 



20 



30 



The isolated nucleic acids of the present 
inventions may be used in hybridization assays for the 
35 detection and quantitation of DNA and/or RNA coding for 
HEK5, HEK7, HEK8, HEK11 and related receptors. Such 
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i 

assays are important in determining the potential of 
various cell types to express these receptors and in 
determining actual expression levels of HEK receptors 1 . 
In addition, the nucleic acids are useful for detecting 
5 abnormalities in HEK receptor genes, such as 

translocations, rearrangements, duplications, etc. 

! 

Therapeutic regimens involving EPH-like 
receptors, will typically involve use of the soluble form 

10 of the receptor contained in a pharmaceutical 

composition. Such pharmaceutical compositions may 
contain pharmaceutical ly acceptable carrier, diluents, 
fillers, salts, buffers, stabilizers and/or other 
materials well known in , the art.' Further examples of 

15 such constituents are described in Remington's 

Pharmaceutical Sciences 18th ed., A.R. Gennaro, ed. 
(1990) . Administration of soluble EPH-like receptor 
compositions may be by a variety of routes depending 
upon the condition being treated, although typically 

20 administration will occur by intravenous or subcutaneous 
methods. Pharmaceutical compositions containing 
antibodies to EPH-like receptors will preferably include 
mouse-human chimeric antibodies or CDR-grafted 
antibodies in order to minimize the potential for an 

25 immune response by the patient to antibodies raised in 
mice. Other components of anti-EPH antibody 
compositions will be similar to those described for 
.soluble receptor. 

The amount of soluble Eph-like receptors or 

30 anti-Eph antibody in a pharmaceutical composition will 
depend upon the nature and severity of the condition 
being treated. Said amount may be determined for a 
given patient by one skilled in the art. It is 
contemplated that the pharmaceutical compositions of the 
35 present invention will contain about 0.01 \ig to about 
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10 



100 mg of soluble receptor or ant i-Eph .antibody per kg 
body weight. 

A method for modulating the activation of an 
EPH-like receptor PTK is also provided by the invention. 
In practicing this method, a therapeutically effective 
amount of a soluble EPH-like receptor or an anti-EPH 
antibody is administered. The term "therapeutically 
effective amount" is that amount which effects an 
increase or decrease in the activation of an EPH-like 
receptor and will range from about 0.01 jig to about 100 
mg of soluble receptor or anti-EPH antibody 'per kg body 
weight. In general, therapy will be appropriate for a 
patient having a condition treatable by soluble receptor 
15 or anti-EPH antibody and it is contemplated that such a 
condition will in part be related to the state of 
proliferation and/or differentiation of receptor-bearing 
cells. Based upon the tissue distribution of HEK 
receptors shown in Table 4, treatment with the 
pharmaceutical compositions of the invention may be 
particularly indicated for disorders involving brain, 
heart, muscle, lung, or pancreas. However, some HEK 
receptors are displayed on a wide variety of tissues, so 
it is understood that the effects of modulating receptor 
25 activation may not be limited to those tissues described 
herein . 

The following examples are offered to more 
fully illustrate the invention, but are not to be 
construed as limiting the scope thereof. Recombinant 
DMA methods used in the following examples are generally 
as described in Sambrook et al. Molecule n^ -i^. A 
laboratory Manual Cold Spring Harbor Laboratory Press, 
2nd ed. (1989) 
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EXAMPLE 1 

winning a nd Sequencing of HEK Receptor CPNA 

We have isolated clones for five members- of 
5 the EPH siab- family of receptor PTKs from a human fetal 
brain cDNA library. Oligonucleotides were designed 
based on conserved amino acid sequences within the 
kinase domain, ' Primer I wa£ based on the amino acid 
sequence , Trp-Thr-Ala-Pro-Glu-Ala-Ile (SEQ ID NO: 1) , 

10 which is well-conserved among PTKs of many families. 

Primer II was based on the sequence Val-Cys-Lys-Val-Ser- 
Asp-Phe-Gly (SEQ ID NO: 2), which is invariant among EPH 
sub-family members but, except for the sequence Asp-Phe- 
Gly, is rarely found in* other PTKs* Fully degenerate 

15 oligonucleotides corresponding to reverse translations 

of these protein sequences were synthesized and utilized 
as primers in a polymerase chain reaction (PCR) with 
disrupted phage from a human fetal brain cDNA library as 
the template. The products of this PCR reaction were 

20 cloned into the plasmid vector pUC19 and the nucleotide 
sequence of the inserts was determined. Of the 35 PCR 
inserts sequenced, 27 were recognizable as portions of 
PTK genes. Their correspondence to previously published 
sequences is summarized in Table 1. 
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Six PCR inserts predict amino acid sequences 
which are identical to a portion of SRC, although they 
comprise two distinct nucleotide sequences. One insert 
appears to code for the human platelet derived growth 
5 factor (PDGF) -p receptor. The remaining 18 PCR inserts 
consist of 6 distinct nucleotide sequences, all of which 
appear to be fragments of EPH 'sub-family members. One 
of the sequence predicts an amino acid sequence 
identical to the corresponding region of rat Elk (Lhotak 

10 et al., 1991)) and is likely to represent its human 

homolog. Two inserts predict amino acid sequences which 
match the translation of the PCR fragment tyro-4 (Lai 
and Lemke, 1991) ) but are clearly distinct at the 
nucleotide level while two others correspond to tyro-1 

15 and tyro-5. The sixth PCR insert has a previously 
unreported HPtf-related sequence. Since five of the 
clones contained portions of potential EPH sub- family 
members for which full-length sequences had not been 
reported, each was 1 radiolabeled and used as t a probe to 

20 screen a human fetal brain cDNA library. Several clones 
corresponding to each of the five probes were isolated. 
For each of the five receptors, the nucleotide sequence 
of the clone containing the largest portion of the 
predicted coding region was determined. 

25 

A single cDNA clone containing the complete 
coding region was isolated only for HEK4. The portions 
of HEK5, BEK1, HEK10 and HEK11 coding for the amino 
terminus of these receptors were not found in any of the 

30 clones. In order to obtain the complete coding 

sequence, the Rapid Amplification of cDNA Ends (RACE) 
technique was employed. In some cases, more than one 
round of RACE was necessary to obtain the missing 
portion of the coding region. Using this strategy, 

35 complete coding sequences were obtained for all clones 
except HEK7 which lacked the complete leader sequence. 
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The DNA sequences of HEK5, HEK7, HEK8 and HEK11 are 
shown in Figures 1-4, respectively, and in SEQ ID NO: 10 
(HEK5), SEQ ID NO: 12 (HEK7) , SEQ ID NO: 14 (HEK 8) and 
SEQ ID NO: 16 (HEK11) . The, amino acid sequences are 
Shown 'in SEQ ID NO: 11 (HEK5) , SEQ lb NO: 13 (HEK7) , SEQ 
ID NO: 15 (HEK8) and SEQ ID NO: 17 (HEK 11). 



EXAMPLE 2 



10 Analyst nf HEK Recpnl-or s^ o ^ o 

HEK5„ HEK7, HEK8 and HEK11 represent novel 
human EPH sub-family members, although homologs for all 
except HEK11 have been isolated from otner species. We 
refer to human EPH receptor sub r family members as HEKs 
(Human .EPH-like kinases) following the nomenclature of 
Wicks et al., 1992) . We have chosen names and numbers 
for these receptors to correspond with previously 
discovered members of the family in chicken (Ceks) and 
in mouse (Mek) (Sajjadi et al. ,1991; Sajjadi and 
Pasquale, 1993; Pasquale, 1991) . Extending the 
convention of designating the species of origin by the' 
first letter, we refer to the rat homologs of the HEK 
receptors as Reks ( £ at Epfl -like kinases) . 



HEK4 is the human homolog of the chicken 
receptor Cek4 (91% amino acid identity in the catalytic 
domain) and the mouse receptor Mek4 (96% amino acid 
identity in the catalytic domain) . The amino acid 
sequence of HEKS is very closely related (96% amino acid 
identity in the catalytic domain) to the chicken 
receptor Cek5 (Pasquale et al. J. Neuroscience 12, 3956- 
3967 (1992); Pasquale, 1991). HEK7 is probably the 
human homolog of the recently reported Cek7 (Sajjadi and 
Pasquale, 1993). HEK8 is likewise very closely related 
35 to Sek (Gilardi-Hebenstreit et al., 1992)) and Cek8 (95% 
amino acid identity in the catalytic domain) (Sajjadi 
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, > 

and Pasguale, 1993) ) . The human homologs for Cek6 and 
Cek9 have yet to be reported, while the human homolog of 
CeklO has just . recently been published. One of our 
human receptors has no close relatives in other species 
and apparently represents a novel member of the EPH sub- 
family. We have designated this receptor HEK11, 
assuming that human homologs for Cek 9 and 10 will be 
named HEK9 and 'hekIO, respectively. A summary of known 
— EPH-sub- f amily -member s-is shown in Table 2. 



10 



TABLE 2 . ^ 

EPH receptor sub- family members 

15 Human 1 Nnn^human homoloas 

gpu None identified 

ECK None identified 

None identified* 1 Eek 

HEK4* Cek4 / Mek4 

20 HEK5 Cek5, Nuk, ERK 

None identified* Cek6, Elk 

HEK7 Cek7, Ehkl 

HEK8 Cek8, Sek 

None identified* Cek9 



25 HEK2 



CeklO 



30 



HEK11 None identified 

None identified Ehk2 

*published by Wicks et.al., 1992 as HEK 
*Using the present nomenclature, the predicted human 
homolog of Eek is designated HEK3 . For Cek6, the 
predicted human homolog is designated HEK6; For Cek9, 
the predicted human homolog is designated HEK9. 
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The predicted amino acid sequences of the four 
novel receptor clones and the previously known EPH 
sub- family members ECK (SEQ ID NO: 18) , EPH (SEQ ID NO: 
19), HEK2 . (SEQ ID NO: 20) and HEK4 (SEQ ID NO: 21) were 
aligned as shown in Fig. 5. The four clones are closely 
related to each other and to the known EPH sub-family 
members. The extracellular domain sequences of all four 
novel receptors contain the Ig-loop, fibronectin-type 
III repeats, and cysteine-rich region characteristic of 
EPH sub-family members. The positions of the 20 
cysteine residues are conserved among all sub-family 
members. Also completely conserved is the portion of 
the catalytic domain used as the basis for the EPH sub- 
family specific pr'imer (Val-Cys-Lys-Val-Ser-Asp-Phe'-Gly, 
SEQ ID NO: 2, amino acids 757-764 in Fig. 5) . Table 3 
summarizes the percentage of sequence identity between 
pairs of human EPH sub- family members. The lower 
portion of the table shows percent amino acid identity 
in the catalytic domain while the upper half shows 
percent amino acid identity in the extracellular region. 
The amino acid sequences of the EPH-like receptors are 
extremely well-conserved (60-89% amino acid identity) in 
the catalytic region but not as highly conserved in the 
extracellular region (38-65% amino acid identity) , as 
would be expected for members of the same receptor sub- 
family. 
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TABLE 3 

Eph family amino acid sequence comparison 

extracellular domains 





EPH 


ECK 


HEK4 


HEK5 


HEK7 


HEK8 


HEK2 


HEK11 


EPH 


'* 


47 


42 


38 


40 


43 


40 


42 


ECK 


62 


* 


47. 


41 1 


45 


46 


41 


46 


HEK4 


62 


76 


* 


53 


65 


61 


51 


59 


HEK5 


60 


74 " 


~81 




52 


53 


63 


51 


HEK7 


61 


76 


89 


83 




62 


48 


61 


HEK8 


62 


76 


86 


85 


88 


* 1 


52 


57 


HEK2 


61 


74 


81 


89 


82 


83 


* 


48 


HEK11 


60 


74 


83 


83 


85 


85 


80 


* 



Catalytic domains 



Numbers shown are precent identity 

i 

10 Pairwise comparisons of amino acid sequences 

can be used to construct phylogenetic trees depicting 
the evolutionary relatedness of a family of molecules. 
Figure 6 is such a tree, which summarizes the 
relationships among the EPH sub-family members. Only 

15 one family member is shown from each group of cross- 
species homologs and the human representative was used 
whenever possible (refer to Table 2 for a summary of 
cross-species homologs) . The branch lengths represent 
the degree of divergence between members. It has been 

20 shown previously that the EPH sub-family lies on a 

branch evolutionarily closer to the cytoplasmic PTKs 
than to other receptor PTKs (Lindberg and Hunter, 1993) . 
Interestingly, the further one moves up the tree, the 
more closely related the receptors become and expression 

25 becomes more localized to the brain. 
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EXAMPLE 3 

Construction and Emrps^ o n of trey Rgrv^ni-n-r 
Extracellular npflijji.infi , 

1 Soluble extracellular ( forms of HEK receptor 
proteins were constructed by deletion of DNA sequences 
encoding transmembrane and cytoplasmic domains of the 
receptors and introduction of a translation stop codon 
at the 3- end of the extracellular domain. A construct 
of the HEK5 extracellular domain had a stop codon 
introduced after lysine at position 524 as shown in 
Figure 1; the HEK7 extracellular domain was constructed 
with a stop codon after glutamine at position 547 as i 
shown in Figure 2; the HEK 8 extracellular domain was 
constructed with a stop codon after threonine at 
position 547 as shown in Figure 3. 

HEK extracellular domain was amplified from a 
human fetal brain cDNA library by PCR using primers 5* 
and 3' to the extracellular domain coding region. 
20 For HEK5, the primers 



5« CTGCTCGCCGCCGTGGAAGAAACG (SEQ ID NO: 22) and; 

5' GCGTCTAGATTATCACTTCTCCTGGATGCTTGTCTGGTA (SEq'iD 
NO: 23) 

were used to amplify the extracellular domain and to 
provide a restriction site for cloning into plasmid 
pDSRcc. In addition, the following primers were used to 
provide a translational start site, the elk receptor 
signal peptide for expression; and a restriction site 
for cloning into pDSRoc: 



f 
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5 ' GCGGTCGACGCCGCCGCCATGGCCCTGGATTGCCTGCTGCTGTTCCTCCTG 

(SEQ ID NO: 24) and; 
5 » CGTTTCTTCCACGGCGGCGAGCAGAGATGCCAGGAGGAACAGCAGCAGGCA 

5 ATC (SEQ ID NO: 25) 

i 

The resulting construct resulted in fusion of 
DNA encoding the elk signal sequence Met-Al a-Leu-Asp- 
Cys-Leu-Leu-Leu-Phe-Leu-Leu-Ala-Ser (SEQ ID NO: 26) to 
10 the first codon of the HEK5 receptor • 

The resulting HEK5 extracellular domain was 
cloned into pDSRa after digestion with Sail and Xbal and 
transfected into CHO cells for expression, 

HEK8 extracellular domain was amplified .from a 
15 human fetal brain cDNA library by PGR using primers 5' 
and 3' to the extracellular domain coding region. For 
HEK8, the primers 

5' GAATTCGTCGACCCGGCGAACCATGGCTGGGAT and 
20 5' GAATTCTCTAGATTATCATGTGGAGTTAGCCCCATCTC 

were used to amplify the extracellular domain and to 
provide restriction sites for cloning into plasmid 
pDSRa. 

25 The resulting HEK8 extracellular domain was 

cloned into pDSRa after digestion with Sail and Xbal and 

transferred CHO cells for expression. 

HEK7 extracellular domain was amplified from a 
human fetal brain cDNA library by PCR using primers 5 1 
30 and 3' to the extracellular domain coding region. For 
HEK7, the primers . . 

5 • TTCGCCCTATTTTCGTGTCTCTTCGGGATTTGCGACGCTCTCCGGACCCTCCTG 
GCCAGC and 

35 5 • GAATTCTCTAGATTATCACTGGCTTTGATCGCTGGAT 
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were used to amplify the extracellular domain . In 
addition, the following primers were used to provide a 
translational start site, the HEK8 receptor signal 
peptide sequence, and restriction site for cloning into 
plasmid pDSRa. 



GAATTCGTCGACCCGGCGAACCATGGCTGGGATTTTCTATTTCGCCCTATTTTCGT 
GTCT 



10 5 1 GAATTCTCTAGATTATCACTGGCTf TGATCGCTGGAT 

The resulting' construct resulted in fusion of 
DNA ihcoding~HEK8 signal sequence Met-Ala-Gly-Ile-Phe-i 

Tyr-Phe-Ala-Leu-Phe-Ser-Cys-Le^-Phe-Gly-Ile-Cys-Asp to 
15 the first codon of the HEK7 receptor. 

The resulting HEK7 extracellular domain was 
cloned into pDSRct after digestion with Sail and Xbal and 
transfected into CHO cells for expression. 



EXAMPLE 4 



Antibodies to HE K Receptor* 

Antibodies to HEK receptor proteins were 
generated which recognize the extracellular domain by 
25 using bacterial fusion proteins as the antigen. 

Antibodies were also generated which recognize the 
cytoplasmic domain by using synthetic peptides as the 
antigen . 

The methodology employed has been previously 
described (Harlow and Lane, In Antibnd ^: a T.^hor^nry 
Manual r 1966) . For the extracellular domain antibodies, 
cDNAs were inserted into the pATH vector (see Table 4 
for the regions of each receptor encoded by this 
construct) . These constructs were expressed in bacteria 
35 and the resultant TrpE-fusion proteins were purified by 
SDS-polyacrylamide gel electrophoresis. For the 
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cytoplasmic domain anti-peptide antibodies, peptides 
were synthesized (see Table 4 for the sequences) and 
covalently coupled to keyhole limpet hemocyanin . The 
fusion proteins and coupled 1 peptides, were used as * 
antigens in rabbits and antisera' were generated and 
characterized as described (Hariow and Lane, 1988) . 
Anti-peptide Antibodies were affinity purified by using 
a SulfoLink kit' (Pierce, Rockford IL) • 



TABLE 4 



15 



20 



i 

Receptor 

HEK4 
HEK5 
HEK7 
HEK8 
HEK11 



HEK Receptor Antigens 

peptide Sfiauences 
CLETQSKNGPVPV 
CRAQMNQ I Q SVEV 
CMKVQLVNGMVPL ' ( , 
CMRTQMQQMHGRMVPV 
CQMLHLHGTGIQV 



Amino Acids in 
Fusion Protein 

22-159 
31-168 
335-545 
27-188 j 
187-503 



25 



30 



35 



EXAMPLE 5 

HEK/TrkB Chimeric Receptors 

1. Generation of pSJAl encoding rat trkB 
cytoplasmic domain. 

All of the chimeric receptors are composed of 
the extracellular domain and the transmembrane region of 
one of the HEK receptors and the intracellular portion 
of rat trkB. To simplify each individual construction, 
an intermediate or parental plasmid, called RtrkB/Aflll 
(or pSJAl), was generated. First, without altering the 
coded peptide sequence, an Aflll site (CTTAAG) was 
introduced into position 2021 (cytosine at position 2021 
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(C2021) to guanine at position 2026 (G2026, CTCAAG) of 
the rat trkB cDNA (Middlemas, et al., Mol. Cell. Biol. 
11, 143-153 (1991)) by PCR aided mutagenesis. Briefly, 
PCR primers were synthesized based on the rat trkB cDNA 
sequence. Primer I encompassed C2003 to G2034 of the 
cDNA. This primer contained two mutations, a cytosine 
to thymine (T) substitution at position 2023 (C2023T) and 
an insertion of an adenine (A) in between T2013 and 
G2014. These mutations created the Aflll site at 
position C2021 and an additional Xhol site flanking the 
Aflll site. Primer II was in the reverse direction 
encompassing T2141 to A2165 of the cDNA which bore an 
Apal site. The PCR fragment produced with these primers 
and the rat trkB cDNA template was digested with Xhol 
15 and Apal enzymes and sub cloned into the Xhol and Apal 
sites of an expression vector, pcDNA3 (InVitroGen) , to 
generate pSJAl-b. Following, pSJAl-b was linearized 
with Apal and ligated with a Banll digested rat trkB 
cDNA fragment (G2151 to G4697) to reconstitute a larger 
20 fragment (C2021 to G4697) including the coding sequence 
of the whole intracellular domain of the rat trkB 
protein (L442 to G790) and 1571 residues (A3131 to 
G4697) of the 1627 nucleotide 3 '-end non-coding region 
of the cDNA. 

25 2. Generation of HEK8/rat trkB (pSJA5) 

chimera . 

HEK8/rat trkB chimera was generated with a 
similar strategy as mentioned above. A Sall/Bsal cDNA 
fragment was first isolated from plasmid TK10/FL13. 
This fragment included the nucleotide sequence from the 
beginning to T1689 of the HEK8 cDNA (Figure 3) . Then, a 
pair of oligonucleotides was synthesized based on the 
HEK8 cDNA sequence. The sequence of the first 
oligonucleotide was the same as G1690 to CI 740 of the 
35 Hek8 cDNA, with an additional C residue added to its 3'- 
end. The second oligonucleotide was in the reverse 
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orientation of the HEK8 cDNA. It contained C1694 to 
C1740 of the HEK8 cDNA sequence and an additional five 
residue motif, TTAAG, at its 5 '-end. These two 
oligonucleotides were kinased and annealed with equal 
5 molar ratio, to create a double strand DNA fragment with 
the sequence of G1690 to C1740 of' the HEK8 cDNA and with 
the Bsal and the Aflll cohesive ends at its 5* and 3 1 
ends, respectively. This fragment was ligated together 
with-the- SaaiyB^sar~cDNA— fragment" into" Xhol / Af 1 1 1 
10 linearized pSJAl to generate the HEK8/RtrkB (pSJA5) 
chimerical construct. i 

3. Generation of HEKll/rat trkB (pSJA6) 

chimera . 

To generate the HEKll/rat trkB chimera, a 

15 Sall/AccI fragment covering the sequence of nucleotide 
CI to T1674 of the HEK11 cDNA (Figure 4) was first 
isolated from plasmid TK19T3. Then, a. pair of 
oligonucleotides was synthesized based on the HEK11 cDNA 
sequence. The first oligonucleotide had the same 

20 sequence as from nucleotide A1666 to T1691 of the HEK11 
cDNA, which contained the AccI site. The second 
oligonucleotide was in the reverse orientation of the 
HEK11 cDNA. It encompassed G1895 to T1919 of the HEK11 
cDNA sequence. An additional ten residue motif, 

25 CCCGCTTAAG, was added to the 5' -end of this 

oligonucleotide to introduce an Aflll site, which would 
be used to link the external domain and the 
transmembrane region of the HEK11 receptor to the 
intracellular domain of the rat trkB cDNA cloned in 

30 pSJAl in the same reading frame. PGR was performed with 
these oligonucleotides as primers and the HEK11 cDNA as 
template. The PCR fragment was digested with AccI and 
Aflll enzymes and ligated with the Sall/AccI cDNA 
fragment and the Xhol/Aflll linearized pSJAl to generate 

35 the HEKll/rat trkB (pSJA6) chimerical construct. 
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EXAMPLE 6 



Tissue Distribution of hrk Rg^pt-^ c 
5 the distribution of mRNA expression for HEK4, 

HEK5, HEK7, HEK8 and HEK11 receptors in human and rat 
tissues was examined by Northern blot hybridization. 

Rat total RNA was prepared from tissues using 
the method of Chomczynski and Sacchi (Anal. Biochem 1£Z, 
10 156-159 (1987)) . The RNA was separated by formaldehyde^ 
agarose electrophoresis and transferred to Hybond-N 
membranes (Amersham, Arlington Heights, IL) using 20X 
SSC (Maniatis et al. 1982). The membrane was dried at 
80OC in vacuo for 30 minutes, then crosslinked for 3 
minutes ,on a UV transilluminator (Fotodyne, New Berlin, 
WI) . The membrane was prehybridized for 2 hours at 42°C 
in 50% formamide, 5X SSPE, 5X Denhardfs, 0.2% SDS, and 
100 jig/ml denatured herring sperm DNA (Maniatis et al. 
1982) . Northern blots of human' tissue were purchased 
from Clontech (Palo Alto, CA) . Probes were prepared by, 
labeling the fragment of cDNA which encoded the 
extracellular domain of the receptor with 32 P . dCTp using 
a hexanucleotide random priming kit (Boehringer 
Mannheim, Indianapolis, IN) to a specific activity of at 
25 least 1x109 cpm/ug. The probe was hybridized to the 

membrane at a concentration of 1-5 ng/ml at 42Q C for 24 
to 36 hours in a buffer similar to the prehybridizatlon 
buffer except that IX Denhardfs was used. After 
hybridization, the membranes were washed 2 times for 5 
30 minutes each in 2X SSC, 0.1% SDS at room temperature 
followed by two 15 minute washes in 0.5X SSC, 0.1% SDS 
at 550C. Blots were exposed for 1-2 weeks using Kodak 
XAR film (Kodak, Rochester, NY) with a Dupont Lightning 
Plus intensifying screen. The results are shown in 
35 Figures 7-11. 
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Homologs for HEK4 have been previously 
identified from mouse, chicken, and rat. In the adult 
mouse, expression is detected primarily in the brain and 
testis, (Sajjadi et al. 1991), A slightly different 
5 pattern was found in adult chicken tissues, with the 
main sources of expression being the brain, liver, and 
kidney. Lower levels of expression were detectable in 
the lung and heart (Marcelle & Eichmann, Oncogene J7, 
2479-248? (1992)). A fragment of the Rek4 gene (tyro-4) 
10 has been isolated and used to look at tissue expression 
in the adult rat (Sajjadi et al. 1991) . The brain was 
the only tissue that expressed Rek4 mRNA. However, RNA 

from lung or testis were not examined. Previous studies 

i 

on HEK4 only looked at the expression of the mRNA in 

15 cell lines, where it was found in one pre-B cell line 
and two T-cell lines (Wicks et al; 1992) . The 
significance of this with regard to in vivo expression 
remains to be determined. In this study we have looked 
at the HEK4 expression in human tissues, and also the 

20 expression of Rek4 in rat tissues. The HBK4 mRNA 

corresponds to a single transcript with a size of about 
7 kb (Fig 7A) . HEK4 mRNA was most abundantly expressed 
in placenta, with lower levels present in heart, brain, 
lung, and liver. On prolonged exposures, trace amounts 

25 of mRNA were detectable in kidney and pancreas. 

Expression in the rat was more similar to that detected 
in the mouse and chicken. Rek4 was expressed at the 
lowest levels of any of the family members characterized 
herein. A transcript of about 7 kb was detectable in 

30 rat lung, with a lower amount detectable in brain (Fig. 
7B) . Also, a 4 kb transcript was expressed in rat 
testis. Because the transcripts were barely detectable 
using total RNA, some of the other rat tissues may 
contain amounts of Rek4 below the level of detection. 
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The expression of HEK5 in adult tissues has 
been previously studied in chicken and rat. Studies in 
the chicken have identified the Cek5 protein in the 
brain and liver, with a smaller protein detected in the 
5 intestine.* In the rat, the tyro-5 fragment detected 
ftiRNA expression only in the adult brain, though 
intestine was not examined (La;L and Lemke, 1991) . Our 
results show that HEK5 mRNA was expressed at much higher 
levels than HEK4 and was found as transcripts of several 

10 sizes. The most abundant mRNAs were of approximately 

4.0 and 4.4 kb, with lesser amounts of higher molecular 
weight transcripts of 9.5 kb and longer (Fig. 8A) . The 
HEK5 mRNA was" most abundantly expressed in placenta, but 
was also highly expressed in brain, pancreas, kidney, 

15 muscle, and lung. Longer exposures of the blots 

revealed the presence of transcripts in heart and liver 
as well. The rat homolog of HEK5 (RekS) showed a 
somewhat similar pattern of expression. RekS was most 
abundant in intestine, followed by brain, kidney, lung, 

20 thymus, stomach, and ovary (Fig. 8B) . Expression was 
not detectable in testis, muscle, heart, or liver. 
During our analysis of this family, we concluded that 
the rat Erk fragment (Chan & Watt, 1991) likely encodes 
a portion of the RekS receptor. Erk expression was 

25 examined in several rat tissues and found only in the 
lung. The reason for the discrepancy between that 
report and what we and others (Lai & Lemke, 1991) have 
found is unclear. 

30 Homologs for HEK8 have been identified from 

chicken, mouse, and rat. In the adult chicken, a single 
Cek8 transcript was found to be expressed at high levels 
in the brain, with expression also detected in the 
kidney, lung, muscle, and thymus. The expression of the 

35 mouse homolog of HEK8, Sek, has been detected as a 

single transcript with abundant expression in the adult 
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brain and lower expression in the heart, lung and 
kidney, A fragment of Rek8 (tyro-1) was used to look at 
expression in rat tissues , with expression found only in 
the brain j (Lai & Lemke, 199 ! 1) . We fpund that HEK8 mRNA 
5 was expressed at levels comparable to that of HEK5. 
Multiple transcripts were also observed, the most 
abundant at 7 kb and 5 kb. The highest level of mRNA 
expression was seen in* the brain, although substantial 
levels urere detected in other tissues including heart /f 

10 lung, muscle, kidney, placenta, and pancreas. 

Expression in liver was .much lower than in the other 
tissues. The only difference in expression patterns ? 
between human and mouse was expression in human muscle, 
also seen for Cek8 in chicken. ' Among the rat tissues, 

15 Rek8 was most highly expressed in the brain, followed by 
the lung, heart, and testis (Fig. 10B) . In contrast to 
HEK8, expression of Rek8 appeared to be lower in muscle 
and kidney, two tissues where l HEK8 was readily 
detectable. In addition, Rek8 was not expressed as a 

20 5.0 kb transcript, as it was not visible even on 
prolonged exposures. 

During the analysis of this family, we deduced 
that HEK7 is the human homolog of Cek7. The only 

25 expression seen in adult chicken was an 8.5 kb 

transcript found in the brain (Sajjadi & Pasquale, 
1993) . Of the five EPH sub- family members described 
- here, HEK7 was the most restricted in its expression 
pattern. Analysis of human mRNA revealed significant 

30 expression only in the brain, with a much lower level 
detectable in the placenta (Fig. 9A) . Prolonged 
exposures did not reveal expression in any other tissue 
examined. Two prominent transcripts were found in 
brain, the most highly expressed with a size of 6 kb and 

35 the other with a length of 9 kb. In the placenta, 

however, only the 9 kb transcript was detected. Rek7 
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mRNA was expressed in a pattern similar to HEK7. The 
highest level of expression was found in brain, with a 
much lower level in ovary (Fig. SB) . The transcripts 
were of similar size as for HEK7, with the 6 kb 
transcript detected only in brain. 

HEK11 was expressed ,as several transcripts, 
with major mRNAs of length 7.5, 6.0 and 3.0 kb and minor 
trans cripts of 4 J 4_Md2_.4 Jeb (Fig. 11A) . All five 
mRNAs were expressed at the highest levels in brain, 
followed by heart. Placenta, lung and kidney had 
significant amounts of four of the five transcripts, 
with lower expression seen in muscle. Pancreas had 
barely detectable amounts of HEK11 mRNA, while liver had 
15 no detectable HEKl!l transcript. Rekll had a similar 
pattern of expression, with four transcripts (10, 7.5, 
3.5 and 3.0 kb) detected in brain (Fig. 11B) . 



10 



20 



The relative level of mRNA expression for each 
of the five receptors in all tissues studied' is 
summarized in Table 5. 
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TABLE 5 

Tissue Distribution of ,HEK Receptors 



Uiitnan 




HEK5 


HEK7 


HEK8 


HEK11 


Brain 




++ 


++ 


+++ 


++ 


Heart 


+ 


+ 


be* 


++ 


+ 


Kidney 


+ 


+ 


bd 


+ 


+ 


Liver 


+ 


+ 


bd 


+ 


bd 


Lung 


+ 


+ 


bd 


++ 

! 


+ 


Muscle 


+ 


+ 


bd 


++ 


+ 


Pancreas 




++ 


bd 


+ 


bd 


Placenta 


+++ 


f 

+++ 


bd 


++ 


+ 


Rat 


HEK4 


HEK5 


HEK7 


HEK8 


HEKll 


Brain 


+ 


++ 


+++ 


+++ 


++ 


Heart 


bd 


i bd 


bd 


. + 

i 


bd 


intestine 


bd 


+++ 


bd 


bd 


bd 


Kidney 


bd 


++ 


bd 


bd 


bd 


Liver 


bd 


bd 


bd 


bd 


bd 


Lung 


+ 


+ 


bd 


++ 


bd 


Muscle 


bd 


bd 


bd 


bd 


bd 


Ovary 


bd 


+ 


+ 


bd 


bd 


Stomach 


bd 


+ 


bd 


bd 


bd 


Testis 


+ 


bd 


bd 


+ 


bd 


Thymus 


bd 


+ 


bd 


bd 


bd 



bd= below detection 
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The transcripts for HEKs 4,5,8, and 11 were 
rather widely distributed in human tissue while HEK7 was 
specific for brain. Expression patterns between rat and 
human tissue were roughly comparable given that the rat 
blots were less sensitive due tq the' use of total RNA 
rather than polyA+. As was found for the Cek mRNAs by 
Sajjadi and Pasquale (Sajjadi & Pasquale, 1993), often 
there were several different size transcripts detected 
for a single receptor. The size distribution of the 
transcripts appears to be both tissue and species 
specific. Previous work has shown that the smaller 
transcript of Mek4 encodes a potentially secreted 
receptor (Sajjadi et al. 1991). 

, The following sections describe Materials and 
Methods used to carry out experiments described in 
Example 1. 

Isolation, Cloning and sequencing nf ^ K rs^or nnKA B 
20 Fragments containing a portion of the , 

catalytic domain of EPH sub-family receptors were 
generated using a polymerase chain reaction (PCR) with 
disrupted phage from a human fetal brain cDNA library as 
a template. A 10*11 aliquot of the cDNA library 
(Stratagene, La Jolla, CA) was treated at 70°C for 5 
minutes to disrupt the phage particles, then cooled on 
wet ice. The disrupted phage were added to 10*11 of 10X 
Im polymerase buffer, 8ul of 2mM each dNTP, 100 
picomoles of each primer, and 1.5 Hi of Tag polymerase 
(Promega, Madison, WI) in a total volume of 100*11. The 
reaction was run for 35 cycles, each consisting of 1 
minute at 96°C, 1 minute at 50°C, and 2 minutes at 72©c. 
A 5 minute, 72°C incubation was added at the end to 
ensure complete extension. The primers used were 
35 degenerate mixtures of oligonucleotides based on amino 



25 



30 
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acid sequences which are highly conserved among EPH 
sub-family members. 

5 1 AGGGAATTCCAYCGNGAYYTNGCNG6 1 (SEQ ID NO: 27); s 
5 5 1 AGGGGATCCRWARSWCCANACRTC 1 ( SEQ ID NO: 28). 

The 'products of the PGR reaction were digested 
with EcoRI and BamHI and cldned into M13mpl9 (Messing, 
Methods-En^ ymol . (1983) ) -for sequence analysis. The , 

10 five clones which were identified as fragments of EPH 
receptor sub-family members were labeled with 3 2 P-dCTP 
by__random priming ancLeach, was „used__to^creen Genescreen 
nitrocellulose filters (NEN, Boston, MA) containing 
plaques from the human fetal brain cDNA library. Phage 

15 stocks prepared from positively screening plaques were 
plated and rescreened with the same probe in order to 
obtain single clones. cDNA inserts were transferred 
into pBluescript using the in vAvo excision protocol 
supplied with the cDNA library (Stratagene, La Jolla, 

20 CA) . Nucleotide sequences were determined using Taq 

DyeDeoxy Terminator Cycle Sequencing kits and an Applied 
Biosystems 373A automated DNA sequencer (Applied 
Biosystems, Foster City, CA) . 



25 5' Race 

The 5' ends of the cDNAs were isolated using a 
5 1 RACE kit (GIBCO/BRL, Gaithersburg, MD) following the 
manufacturers instructions. Excess primers were 
removed after first strand cDNA synthesis using 

30 ultrafree-MC cellulose filters (30,000 molecular weight 
cutoff, Millipore, Bedford, MA). Amplified PCR products, 
were digested with the appropriate restriction enzymes, 
separated by agarose gel electrophoresis, and purified 
using a Geneclean kit (BiolOl, La Jolla, CA) . The 

35 purified PCR product was ligated into the plasmid vector 
pUCl9 (Yanisch-Perron et al. Gene 103-119 (1985)) 
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which had been digested with appropriate restriction 
enzymes and the ligation mixture was introduced into 
host bacteria by electroporation. ' Plasmid DNA was 
prepared from the resulting colonies. Those clones with 
the largest inserts were selected for DNA sequencing. 



While the present invention has been described 
10 in terms of preferred embodiments, it is understood that 
variations and modifications will occur to those skilled 
in the art. Therefore, it is intended that' the appended 
claims cover "all such equivalent variations which come 
within the scope of the invention as claimed. 
15 ' * 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Amgen Inc. 

(ii) TITLE OF INVENTION: EPH-Like Receptor Protein Tyrosine 
Kinases 1 

t 

(iii) NUMBER OF SEQUENCES: 2B 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Amgen Patent Operations /RBW 

(B) STREET: 1840 Dehavilland Drive 

(C) CITY: Thousand Oaks 

(D) STATE: California , 

(E) COUNTRY: USA 

(F) ZIP: 91320 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1-0, Version #1,25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 
(A) NAME: Winter, Robert B. 
(C) REFERENCE/DOCKET NUMBER: A-287 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY : linear 

' (ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Trp Thr Ala Pro Glu Ala lie 
1 5 
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(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS: single , 

(D) i TOPOLOGY: linear , * 

i 

(ii) MOLECULE TYPE: protein 

! 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Val Cys Lys- Val Ser Asp Phe Gly 
1 5 

(2) INFORMATION FOR SEQ ' ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids " ' 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single ' 

(D) TOPOLOGY: linear 

i 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:i: 

i 

Val Cys Lys Val Ser Asp Phe Gly Leu Ser Arg Tyr Leu Gin Asp Asp t 
1 5 " 15 1 

Thr Ser Asp Pro Thr Tyr Thr Ser Ser Leu Gly Gly Lys He Pro Val 
20 25 30 

Arg Trp Thr Ala Pro Glu Ala He 
35 40 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Val Cys Lys Val Ser Asp Phe Gly Leu Ser Arg Val Leu Glu Asp Asp 
5 10 15 



I 
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Pro Glu Ala Ala Tyr Thr Thr Arg Gly Gly Lys lie Pro lie Arg Trp 

20 x 25 • 30 

Thr Ala Pro Glu Ala He 
35 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Val Cys Lys Val Ser Asp Phe Gly Leu Ser Arg Phe Leu Glu Asp J^sp 
1 5 10 .15 

Thr Ser Asp Pro Thr Tyr Thr Ser Ala Leu Gly Gly Lys lie Pro lie 
20 25 30 

Arg Trp Thr Ala Pro Glu Ala lie 
35 40 

! 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Val Cys Lys Val Ser Asp Phe Gly Met Ser Arg Val Leu Glu Asp Asp 
1 5 .10 15 

Pro Glu Ala Ala Tyr Thr Thr Arg Gly Gly Lys He Pro He Arg Trp 
20 25 30 

Thr Ala Pro Glu Ala He 
35 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single , 

(D) i TOPpLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 



PCT/US95/04681 

i 



Val' Cys Lys Val Ser Asp Phe Gly I^fi-SerArg Vai ll£-Glu Asp Asp 
1 5 10 15 

Pro Glu Ala Val Tyr Thr Thr Thr Gly Gly Lys lie Pro Val Arc Trp 
20 25 30 

Thr Ala Pro Glu Ala lie 1 
35 

l 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 1 

! 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Val Cys Lys Val Ser Asp Phe Gly Leu Ala Arg Leu He Glu Asp Asn 

Glu Tyr Thr Ala Arg Gin Gly Ala Lys Phe Pro He Lys Trp Thr Ala 
20 25 30 

Pro Glu Ala He 
35 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

Val Cys Lys Val Ser Asp Phe Gly Leu Ala Arg Asp lie Met Arg Asp 
1 5 , 10 15 

Ser Asn Tyr lie Ser Lys Gly Ser Thr Phe Leu Pro Leu Lys Trp Thr 
20 25 ( 30 

Ala Pro Glu Ala He 

35 , 

(2) INFORMATION FOR SEQ ID NO: 10:, f 

(i) SEQUENCp CHARACTERISTICS: 

(A) LENGTH: 2962 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

v 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATUIjtE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..2913 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CTG CTC GCC GCC GTG GAA GAA ACG CTA ATG GAc'tCC ACT ACA GCG ACT 48 
Leu Leu Ala Ala Val Glu Glu Thr Leu Met Asp Ser Thr Thr Ala Thr , 
1 5 10 15 

GCT GAG CTG GGC TGG ATG GTG CAT CCT CCA TCA GGG TGG GAA GAG GTG 96 
Ala Glu Leu Gly Trp Met Val His Pro Pro Ser Gly Trp Glu Glu Val 
20 25 30 

AGT GGC TAC GAT GAG AAC ATG AAC ACG ATC CGC ACG TAC CAG GTG TGC 144 
Ser Gly Tyr Asp Glu Asn Met Asn Thr He Arg Thr Tyr Gin Val Cys 



35 



40 45 



AAC GTG TTT GAG TCA AGC CAG AAC AAC TGG CTA CGG ACC AAG TTT ATC 192 
Asn Val Phe Glu Ser Ser Gin Asn Asn Trp Leu Arg Thr Lys Phe He 

55 60 



50 



CGG CGC CGT GGG GCC CAC CGC ATC CAC GTG GAG ATG AAG TTT TCG GTG 
Arg Arg Arg Gly Ala His Arg He His Val Glu Met Lys Phe Ser Val 

70 75 »0 



65 



240 



CGT GAC TGC AGC AGC ATC CCC AGC GTG CCT GGC TCC TGC AAG GAG ACC 288 
Arg Asp Cys Ser Ser He Pro Ser Val Pro Gly Ser Cys Lys Glu Thr 
85 90 95 

TTC AAC CTC TAT TAC TAT GAG GCT GAC TTT GAC TCG GCC ACC AAG ACC 336 
Phe Asn Leu Tyr Tyr Tyr Glu Ala Asp Phe Asp Ser Ala Thr Lys Thr 
100 105 HO 



BNSDOCID: <WO 9528484 A 1J_> 



WO 95/28484 

PCT/US9S/04681 

- 42 - 



^ ATG GAG ** T CCA TGG GTG ™ G GTG GA * ACC ATT GCA 
Phe Pro Asn Trp Met Glu Asn Pro Trp Val Lys val Asp Thr lie Ala 

115 120 125 

GCC GAC GAG AGC TTC TCC CAG GTG GAC CTG GGT GGC CGC GTC ATG AAA 
Ala Asp Glu Ser Phe Ser Gin Val Asp Leu Gly Gly Arg vS £? J£ 

135 

r? C ^° ™v C ^ GTG CGG AGC TTC CCT GTG TCC CGC AGC GGC TTC 

lie Asn Thr Glu Val Arg Ser Phe Gly Pro Val Ser Arg Ser Gly Phe 

0 155 i6 0 

TAC CTG GCC TTC CAG GAC TAT GGC GGC TGC ATG TCC CTC ATC GCC GTG 
Tyr Leu Ala Phe Gin Asp Tyr Gly Gly Cys Met Ser Leu S Sa vlt 
165 170 175 

2£ 2? 2° f GC **° TGC CCC CGC ATC ATC CAG GGC GCC ATC 

Arg val Phe Tyr Arg Lys Cys Pro Arg lie n e Gin Asn Gly Ala ill 

v 185 190 

^ AG ^ ™ CC CTG TCG 666 GCT GAG AGC A CA TCG CTG GTG GCT GCC 
Phe Gin Glu Thr Leu Ser Gly Ala Glu Ser Thr Ser Leu vlt Sa £a 

CGG GGC AGC TGC ATC GCC AAT GCG GAA GAG GTG GAT GTA CCC ATC Air 
Arg Gly Ser Cys He Ala Asn Ala Glu Glu Val Asp vlt J£ £ G 

215 220 

CTC TAC TGT AAC GGG GAC GGC GAG TGG CTG GTG CCC ATC GGT rrn n«o 

225 ^ * — G1 » Jg "X *P S K s s s; 

235 240 

ATG TGC AAA GCA GGC TTC GAG GCC GTT GAG AAT GGC ACC GTC TGC CGA 
Met Cys Lys Ala Gly Phe Glu Ala Val Glu Asn Gly tS vlt 2* 

250 255 

265 270 

?° o GT f CC AT ° ^ AGC CGG ACC ACT T CT GAA GGG GCC ACC AAC 
Thr H.s Cys Pro He Asn Ser Arg Thr Thr Ser Glu Gly 2a 

s 280 285 

TGT GTC TGC CGC AAT GGC TAC TAC AGA GCA GAC CTG GAC CCC CTC ran 
Cys Val Cys Arg Asn Gly Tyr Tyr Arg Ala Asp Leu £p £o £u £p 

295 300 

J! 6 f CC J GC ACA ACC ATC CC C TCC GCG CCC CAG GCT GTG ATT TCC AGT 
Met Pro Cys Thr Thr lie Pro Ser Ala Pro Gin Ala Val S sSr Ser 

315 320 

5? i** ^ A 5° TCC CTC ATG CTG GAG TGG ACC CCT CCC CGC GAC TCC 
Val Asn Glu Thr Ser Leu Met Leu Glu Trp Thr Pro Pro Arg Sp Ser 

325 330 * 33 f 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 
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GGA GGC CGA GAG GAC CTC GTC TAC AAC ATC ATC TGC AAG AGC TGT GGC 
Gly Gly Arg Glu Asp Leu Val Tyr Asn lie He Cys Lys Ser Cys Gly 
340 345 • 350 



1056 



TCG GGC CGG GGT GCC TGC ACC CGC TGC GGG GAC AAT GTA CAG TAC GCA 
Ser Gly Arg Gly Ala Cys Thr Arg Cys Gly Asp Asn Val Gin Tyr Ala 
355 360 365 



1104 



CCA CGC CAG CTA GGC CTG ACC GAG CCA CGC ATT TAC ATC AGT GAC CTG 
Pro Arg Gin Leu Gly Leu Thr Glu Pro Arg lie Tyr He Ser Asp Leu 
370 375 380 



1152 



CTG GCC CAC ACC CAG TAC ACC TTC GAG ATC CAG GCT GTG AAC GGC GTT 
Leu Ala His- Thr-Gln- Tyr Thr Phe Glu He Gin Ala Val Asn Gly Val 
385 390 395 400 



1200 



ACT GAC CAG AGC CCC TTC TCG CCT CAG TTC GCC TCT GTG AAC £TC ACC 
Thr Asp Gin Ser Pro Phe Ser Pro Gin Phe Ala Ser Val Asn He Thr 
405 410 415 



1248 



ACC AAC CAG GCA GQT CCA TCG GCA GTG TCC ATC ATG CAT CAG GTG AGC 
Thr Asn Gin Ala Ala Pro Ser Ala Val Ser He Met His Gin Val Ser 
420 425 430 



1296 



CGC ACC GTG GAC AGC ATT ACC CTG TCG TGG TCC CAG CCG GAC CAG CCC 
Arg Thr Val Asp Ser He Thr Leu Ser Trp Ser Gin Pro Asp Gin Pro 
435 440 445 



1344 



AAT GGC GTG ATC CTG GAC TAT GAG CTG CAG TAC TAT GAG AAG GAG CTC 
Asn Gly Val He Leu Asp Tyr felu Leu Gin Tyr' Tyr Glu Lys Glu Leu 
450 455 460 



1392 



AGT GAG TAC AAC GCC ACA GCC ATA AAA AGC CCC ACC AAC ACG GTC ACG 
Ser Glu Tyr Asn Ala Thr Ala He Lys Ser Pro Thr Asn Thr Val Thr 
465 470 475 480 



1440 



GGC CTC AAA GCC GGC GCC ATC TAT GTC TTC CAG GTG CGG GCA CGC ACT 
Gly Leu Lys Ala Gly Ala He Tyr Val Phe Gin Val Arg Ala Arg Thr 
485 490 495 



1488 



GTG GCA GGC TAC GGG CGC TAC AGC GGC AAG ATG TAC TTC CAG ACC ATG 
Val Ala Gly Tyr Gly Arg Tyr Ser Gly Lys Met Tyr Phe Gin Thr Met 
500 505 510 



1536 



ACA GAA GCC GAG TAC CAG ACA AGC ATC CAG GAG AAG TTG CCA CTC ATC 
Thr Glu Ala Glu Tyr Gin Thr Ser He Gin Glu Lys Leu Pro Leu He 
515 520 525 



1584 



ATC GGC TCC TCG GCC GCT GGC CTG GTC TTC CTC ATT GCT GTG GTT GTC 
He Gly Ser Ser Ala Ala Gly Leu Val Phe Leu He Ala Val Val Val 
530 535 540 



1632 



ATC GCC ATC GTG TGT AAC AGA CGG GGG TTT GAG CGT GCT GAC TCG GAG 
lie Ala lie Val Cys Asn Arg Arg Gly Phe Glu Arg Ala Asp Ser Glu 
545 550 555 560 



1680 
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TAC ACG GAC AAG CTG CAA CAC TAC ACC AGT GGC CAC ATA ACC CCA GGC 1728 
Tyr Thr Asp Lys Leu Gin His Tyr Thr Ser Gly His He Thr Pro Gly 
565 570 575 

ATG AAG ATC TAC ATC GAT ■ CCT TTC ACC TAC GAG GAC CCC AAC GAG GCA 1 1776 

Met Lys He Tyr He Asp Pro Phe Thr Tyr Glu Asp Pro Asn Glu Ala 

580 , 585 590 * 

i 

GTG CGG GAG TTT GCC AAG GAA ATT GAC ATC TCC TGT GTC AAA ATT GAG 1824 
Val Arg Glu Phe Ala Lys Glu He Asp He Ser Cys Val Lys He Glu 
595 .600 605 

CAG GTG ATC GGA GCA GGG GAG TTT GGC GAG GTC TGC AGT GGC CAC CTG 1872 
Gin Val He Gly Ala Gly Glu Phe Gly Glu Val Cys Ser Gly His Leu 
610 ' 615 620 

AAG CTG CCA GGC AAG AGA GAG ATC TTT GTG GCC ATC AAG ACG CTC AAG 1920 
Lys Leu Pro Gly Lys Arg Glu He Phe? Val Ala He Lys Thr Leu Lys 
625 630 635 640 

TCG GGC TAC ACG GAG AAG CAG CGC CGG GAC TTC CTG AGC GAA GCC TCC 
Ser Gly Tyr Thr Glu Lys Gin Arg Arg Asp Phe Leu Ser Glu Ala Ser 
645 650 ' 655 

ATC ATG GGC CAG TTC GAC CAT CCC AAC GTC ATC CAC CTG GAG GGT GTC 2016 
He Met Gly Gin Phe Asp His Pro Asn Val He His Leu Glu Gly Val 
660 665 670 

GTG ACC AAG AGC ACA CCT GTG ATG ATC ATC ACC GAG TTC ATG GAG AAT 2064 
Val Thr Lys Ser Thr Pro Val Met He He Thr , Glu Phe Met Glu Asn 
675 680 1 685 

GGC TCC CTG GAC TCC TTT CTC CGG CAA AAC GAT GGG CAG TTC ACA GTC 2112 
Gly Ser Leu Asp Ser Phe Leu Arg Gin Asn Asp Gly Gin Phe Thr Val 
690 695 700 

ATC CAG CTG GTG GGC ATG CTT CGG GGC ATC GCA GCT GGC ATG AAG TAC 2160 
He Gin Leu Val Gly Met Leu Arg Gly He Ala Ala Gly Met Lys Tyr 
705 7 " 715 720 

CTG GCA GAC ATG AAC TAT GTT CAC CGT GAC CTG GCT GCC CGC AAC ATC 2208 
Leu Ala Asp Met Asn Tyr Val His Arg Asp Leu Ala Ala Arg Asn He 
725 730 735 

CTC GTC AAC AGC AAC CTG GTC TGC AAG GTG TCG GAC TTT GGG CTC TCA 2256 
Leu Val Asn Ser Asn Leu Val Cys Lys Val Ser Asp Phe Gly Leu Ser 
7 40 745 750 

CGC TTT CTA GAG GAC GAT ACC TCA GAC CCC ACC TAC ACC AGT GCC CTG 2304 
Arg Phe Leu Glu Asp Asp Thr Ser Asp Pro Thr Tyr Thr Ser Ala Leu 
755 760 765 

GGC GGA AAG TTC CCC ATC CGC TGG ACA GCC CCG GAA GCC ATC CAG TAC 2352 
Gly Gly Lys Phe Pro He Arg Trp Thr Ala Pro Glu Ala He Gin Tvr 
770 775 780 
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CGG AAG TTC ACC TCG GCC AGT GAT GTG TGG AGC TAC GGC ATT GTC ATG 
Arg Lys Phe Thr Ser Ala Ser Asp Val Trp Ser Tyr Gly lie Val Met 
785 790 795 800 

TGG GAG GTG ATG TCC TAT GGG GAG CGG CCCp TAC TGG GAC ATG ACC AAC 
Trp Glu Val Met Ser Tyr Gly Glu Arg Pro Tyr Trp Asp Met Thr Asn 



805 



810 



815 



2400 



2448 



CAG GAT GTA ATC AAT GCC ATT GAG CAG GAC TAT CGG CTG CCA CCG CCC 
Gin Asp Val lie Asn Ala lie Glu Gin Asp Tyr Arg Leu Pro Pro Pro 

820 ' 825 830 

I 

i i 

ATG GAC TGC CCG AGC GCC CTG CAC CAA CTC ATG CTG GAC TGT TGG CAG 
Met Asp-eys— Pro Ser— Ala— Leu His Gin Leu Met Leu Asp Cys Trp Gin 
835 840 845 



2496 



2544 



AAG GAC CGC AAC CAC CGG' CCC AAG TTC GGC CAA ATT GTC AAC ACG CTA 2592 
Lys Asp Arg Asn His Arg Pro Lys Phe Gly Gin He Val Asn Thr Leu 
850 855 860 , 

i 

GAC AAG ATG ATC CGC AAT CCC AAC AGC CTC AAA GCC ATG GCG CCC CTC 2640 
Asp Lys Met lie Arg Asn Pro Asn Ser Leu Lys' Ala Met Ala Pro Leu 
865 ( 870 875 880 

TCC TCT GGC ATC AAC CTG CCG CTG CTG GAC CGC ACG ATC CCC GAC TAC 2688 
Ser Ser Gly He Asn Leu Pro Leu Leu Asp Arg Thr He Pro Asp Tyr 
885 890 895 



ACC AGC TTT AAC ACG GTG GAC GAG TGG CTG GAG , GCC ATC AAG ATG GGG 
Thr Ser Phe Asn Thr Val Asp Glu Trp Leu Glu 1 Ala He Lys Met Gly 
900 905 910 



2736 



CAG TAC AAG GAG AGC TTC GCC AAT GCC GGC TTC ACC TCC TTT GAC GTC 
Gin Tyr Lys Glu Ser Phe Ala Asn Ala Gly Phe Thr Ser Phe Asp Val 
915 920 925 



2784 



GTG TCT CAG ATG ATG ATG GAG GAC ATT CTC CGG GTT GGG GTC ACT TTG 
Val Ser Gin Met Met Met Glu Asp He Leu Arg Val Gly Val Thr Leu 
930 935 940 



2832 



GCT GGC CAC CAG AAA AAA ATC CTG AAC AGT ATC CAG GTG ATG CGG GCG 
Ala Gly His Gin Lys Lys He Leu Asn Ser He Gin Val Met Arg Ala 
945 950 955 960 



2880 



CAG ATG AAC CAG ATT CAG TCT GTG GAG GTT TGACATTCAC CTGCCTCGGC 
Gin Met Asn Gin He Gin Ser Val Glu Val 
965 970 



2930 



TCACCTCTTC CTCCAAGCCC CGCCCCCTCT GC 



2962 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 970 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO ill: 

lieu Leu Ala Ala Val Glu Glu Thr Leu Met Asp Ser Thr Thr Ala Thr 
1 5 10 15 

Ala Glu Leu Gly tfrp Met Val His Pro Pro Ser Gly Trp Glu Glu Val 
20 25 30 

i 

Ser Gly Tyr Asp Glu Asn Met Asn Thr lie J^rg Thr Tyr Gin Val Cys 
35 40 45 

Asn Val Phe Glu Ser Ser Gin Asn Asn Trp Leu Arg Thr Lys Phe lie 
50 55 €0 

Arg Arg Arg Gly Ala His Arg He His Val Glu Met Lys Phe Ser Val 
65 70 75 ( 80 

Arg Asp Cys Ser Ser He Pro Ser Val Pro Gly Ser Cys Lys Glu Thr 
85 90 95 

Phe Asn Leu Tyr Tyr Tyr Glu Ala Asp Phe Asp Ser Ala Thr Lys Thr 
100 105 110 

Phe Pro Asn Trp Met Glu Asn Pro Trp Val Lys Val Asp Thr lie Ala 
115 120 125 

Ala Asp Glu Ser Phe Ser Gin Val Asp Leu Gly Gly Arg Val Met Lys 
130 135 140 

I 

He Asn Thr Glu Val Arg Ser Phe Gly Pro Val Ser Arg Ser (Sly Phe 
145 150 155 160 

Tyr Leu Ala Phe Gin Asp Tyr Gly Gly Cys Met Ser Leu lie Ala Val 
165 170 175 

Arg Val Phe Tyr Arg Lys Cys Pro Arg lie He Gin Asn Gly Ala He 
180 185 190 

Phe Gin Glu Thr Leu Ser Gly Ala Glu Ser Thr Ser Leu Val Ala Ala 
195 200 205 

Arg Gly Ser Cys He Ala Asn Ala Glu Glu Val Asp Val Pro He Lys 
210 215 220 

Leu Tyr Cys Asn Gly Asp Gly Glu Trp Leu Val Pro He Gly Arg Cys 
225 230 235 240 

Met Cys Lys Ala Gly Phe Glu Ala Val Glu Asn Gly Thr Val Cys Arg 
245 250 255 

Gly Cys Pro Ser Gly Thr Phe Lys Ala Asn Gin Gly Asp Glu Ala Cys 
260 265 270 

Thr His Cys Pro He Asn Ser Arg Thr Thr Ser Glu Gly Ala Thr Asn 
275 280 285 
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Cys Val Cys Arg Asn Gly Tyr Tyr Arg Ala Asp Leu Asp Pro Leu Asp 
290 295 300 

Met Pro Cys Thr Thr lie Pro Ser Ala Pro Gin Ala Val lie Ser Ser 
305 310 , 315 320^ 

Val Asn Glu Thr ber Leu Met Leu Glu Trp Thr Pro Pro Arg Asp Ser 
325 330 335 

Gly Gly Arg Glu Asp Leu Val Tyr Asn lie lie Cys Lys Ser Cys Gly 
340 345 350 

Ser Gly Arg Gly Ala Cys Thr Arg Cys Gly Asp Asn Val Gin Tyr Ala 
355 f 360 365 

Pro Arg Gin Leu Gly Leu Thr Glu Pro Arg lie Tyr lie Ser Asp Leu 
370 ' 375 380 

Leu Ala His Thr Gin Tyr Thr Phe Glu lie Gin. Ala Val, Asn Gly Val 
385 390 395 400 

Thr Asp Gin Ser Pro Phe Ser Pro, Gin Phe Aia Ser Val Asn He Thr 
, 405 410 415 

Thr Asn Gin Ala Ala Pro Ser Ala Vai Ser lie Met His Gin Val Ser 
420 425 430 

Arg Thr Val Asp Ser He Thr Leu Ser Trp Ser Gin Pro Asp Gin Pro 

435 440 1 , 445 

* i 

Asn Gly Val lie Leu Asp Tyr Glu Leu Gin Tyr Tyr Glu Lys Glu Leu 
450 455 460 

Ser Glu Tyr Asn Ala Thr Ala He Lys Ser Pro Thr Asn Thr Val Thr 
465 470 475 480 

Gly Leu Lys Ala Gly Ala He Tyr Val Phe Gin Val Arg Ala Arg Thr 
485 490 495 

Val Ala Gly Tyr Gly Arg Tyr Ser. Gly Lys Met Tyr Phe Gin Thr Met 
500 505 . 510 

Thr Glu Ala Glu Tyr Gin Thr Ser He Gin Glu Lys Leu Pro Leu He 
515 520 525 

He Gly Ser Ser Ala Ala Gly Leu Val Phe Leu He Ala Val Val Val 
530 535 540 

He Ala He Val Cys Asn Arg Arg Gly Phe Glu Arg Ala Asp Ser Glu 
545 550 555 560 

Tvr Thr Asp Lys Leu Gin His Tyr Thr Ser Gly His He Thr Pro Gly 
565 570 575 

Met Lys He Tyr He Asp Pro Phe Thr Tyr Glu Asp Pro Asn Glu Ala 
580 585 590 
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Val Arg Glu Phe Ala Lys Glu lie Asp lie Ser Cys Val Lys He Glu 
595 600 605 

Gin Val lie Gly Ala Gly Glu Phe Gly Glu Val Cys Ser Gly His Leu 
610 615 620 

Lys Leu Pro Gly Lys Arg Glu He Phe Val Ala He Lys Thr Leu Lys 
625 630 635 640 

Ser Gly Tyr Thr Glu Lys Gin Arg Arg Asp Phe Leu Ser Glu Ala Ser 
645 650 i 655 

He Met Gly Gin Phe Asp His Pro Asn Val He His Leu Glu Gly Val 
6 .6P . 665 670 



Val Thr Lys Ser Thr Pro Val Met He He Thr Glu Phe Met Glu Asn 
675 680 685 

Gly LSU **** Ser Phe Leu Ar 9 Gln Asn Asp Gly Gin Phe Thr Val 
690 695 700 

He Gin Leu Val Gly Met Leu Arg Gly He Ala Ala Gly Met Lys Tyr 
/05 710 ' 715 720 • 

Leu Ala Asp Met Asn Tyr Val His Arg Asp Leu Ala Ala Arg Asn He 
725 730 735 

Leu Val Asn Ser Asn Leu Val Cys Lys Val Ser Asp Phe Gly Leu Ser 
740 745 750 

Arg Phe Leu Glu Asp Asp Thr 'ser Asp Pro Thr Tyr Thr Ser Ala Leu 
755 760 765 

Gly Gly Lys Phe Pro He Arg Trp Thr Ala Pro Glu Ala He Gin Tyr 
770 775 780 

Arg Lys Phe Thr Ser Ala Ser Asp Val Trp Ser Tyr Gly He Val Met 
785 79 ° 7 95 soo 

Trp Glu Val Met Ser Tyr Gly Glu Arg Pro Tyr Trp Asp Met Thr Asn 
805 810 815 

Gin Asp Val He Asn Ala He Glu Gin Asp Tyr Arg Leu Pro Pro Pro 
820 825 830 

Met Asp Cys Pro Ser Ala Leu His Gin Leu Met Leu Asp Cys Trp Gin 
835 840 845 

Lys Asp Arg Asn His Arg Pro Lys Phe Gly Gin He Val Asn Thr Leu 
850 855 . 860 

Asp Lys Met He Arg Asn Pro Asn Ser Leu Lys Ala Met Ala Pro Leu 
865 870 875 880 

Ser Ser Gly He Asn Leu Pro Leu Leu Asp Arg Thr He Pro Asp Tyr 
885 890 895 
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Thr Ser Phe Asn Thr Val Asp Glu Trp Leu Glu Ala lie Lys Met Gly 
900 905 910 

Gin Tyr Lys Glu Ser Phe Ala Asn Ala Gly Phe Thr Ser Phe Asp Val 
915 920 925 

Val Ser Gin Met Met Met Glu Asp He Leu Arg Val Gly Val Thr Leu 
930 935 940 

i 

Ala Gly His Gin Lys Lys He Leu Asn Ser Il,e Gin Val Met Arg Ala 
945 950 955 960 

Gin Met Asn Gin He Gin Ser Val Glu Val 
965 970 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3162 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : /single 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.. 2-97 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CCA GCG TCC CTG GCC GGC TGC TAC TCT GCA CCT CGA CGG GCT CCC CTC 48 
Pro Ala Ser Leu Ala Gly Cys Tyr Ser Ala Pro Arg Arg Ala Pro Leu 
15 10 15 

TGG ACG TGC CTT CTC CTG TGC GCC GCA CTC CGG ACC CTC CTG GCC AGC 96 
Trp Thr Cys Leu Leu Leu Cys Ala Ala Leu Arg Thr Leu Leu Ala Ser 
20 25 30 

CCC AGC AAC GAA GTG AAT TTA TTG GAT TCA CGC ACT GTC ATG GGG GAC 144 
Pro Ser Asn Glu Val Asn Leu Leu Asp Ser Arg Thr Val Met Gly Asp 
35 40 45 

CTG GGA TGG ATT GCT TTT CCA AAA AAT GGG TGG GAA GAG ATT GGT GAA 192 
Leu Gly Trp He Ala Phe Pro Lys Asn Gly Trp Glu Glu He Gly Glu 
50 55 60 

GTG GAT GAA AAT TAT GCC CCT ATC CAC ACA TAC CAA GTA TGC AAA GTG 240 
Val Asp Glu Asn Tyr Ala Pro He His Thr Tyr Gin Val Cys Lys Val 
65 70 75 80 

ATG GAA CAG AAT CAG AAT AAC TGG CTT TTG ACC AGT TGG ATC TCC AAT 288 
Met Glu Gin Asn Gin Asn Asn Trp Leu Leu Thr Ser Trp He Ser Asn 
85 90 95 
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GAA GGT GCT TCC AGA ATC TTC ATA GAA CTC AAA TTT ACC CTG CGG GAC 336 
Glu Gly Ala Ser Arg lie Phe lie Glu Leu Lys Phe Thr Leu Arg Asp 
100 105 no 

TGC AAC AGC CTT CCT GGA GGA CTG GGG ACC TGT AAG GAA ACC TTT AAT 384 
Cys Asn Ser Leu Pro Gly Gly Leu Gly Th* Cys Lys Glu Thr Phe Asn 

115 , 120 125 1 

ATG TAT TAC TTT GAG TCA GAT GAT CAG AAT GGG AGA AAC ATC AAG GAA 432 
Met Tyr Tyr Phe Glu Ser Asp Asp Gin Asn Gly Arg Asn He Lys Glu 
130 135 140 

AAC CAA TAC ATC AAA ATT ' GAT ACC ATT GCT GCC- GAT GAA AGC TTT ACA 
Asn Gin Tyr He Lys He Asp Thr He Ala Ala Asp Glu Ser Phe Thr 
145 , 150 155 



160 



480 



576 



GAA CTT GAT CTT GGT GAC CGT GTT ATG AAA CTG AAT ACA GAG GTC AGA 528 
Glu Leu Asp Leu Gly Asp Arg Val Met Lys Leu Asn Thr Glu Val Arg 
165 170 175 

GAT GTA GGA CCT CTA AGC AAA AAG GGA TTT TAT CTT GCT TTT CAA GAT 
Asp Val Gly Pro Leu Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp 
18 <> 185 190 

GTT GGT GCT TGC ATT GCT CTG GTT TCT GTG CGT GTA TAC TAT AAA AAA 624 
Val Gly Ala Cys He Ala Leu Val Ser Val Arg Val Tyr Tyr Lvs Lva 
195 200 205 

TGC CCT TCT GTG GTA CGA CAC TTG GCT GTC TTC CCT GAC ACC ATC ACT 672 
Cys Pro Ser Val Val Arg His Leu Ala Val Phe, Pro Asp Thr He Thr 
210 215 1 220 

GGA GCT GAT TCT TCC CAA TTG CTC GAA GTG TCG GGC TCC TGT GTC AAC '720 
Gly Ala Asp Ser Ser Gin Leu Leu Glu Val Ser Gly Ser Cys Val Asn 
225 23 0 235 240 

CAT TCT GTG ACC GAT GAA CCT CCC AAA ATG CAC TGC AGC GCC GAA GGG 768 
Hxs Ser Val Thr Asp Glu Pro Pro Lys Met His Cys Ser Ala Glu Glv 
2 « 250 255 

GAG TGG CTG GTG CCC ATC GGG AAA TGC ATG TGC AAG GCA GGA TAT GAA 816 
Glu Trp Leu Val Pro He Gly Lys Cys Met Cys Lys Ala Gly Tyr Glu 
260 265 270 

GAG AAA AAT GGC ACC TGT CAA GTG TGC AGA CCT GGG TTC TTC AAA GCC 864 
Glu Lys Asn Gly Thr Cys Gin Val Cys Arg Pro Gly Phe Phe Lys Ala 
275 280 285 

TCA CCT CAC ATC CAG AGC TGC GGC AAA TGT CCA CCT CAC AGT TAT ACC 912 
Ser Pro His He Gin Ser Cys Gly Lys Cys Pro Pro His Ser Tvr Thr 
290 295 300 

CAT GAG GAA GCT TCA ACC TCT TGT GTC TGT GAA AAG GAT TAT TTC AGG 960 
His Glu Glu Ala Ser Thr Ser Cys Val Cys Glu Lys Asp Tyr Phe Arg 
305 310 315 320 
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AGA GAG TCT GAT CCA CCC ACA ATG GCA TGC ACA AGA CCC CCC TCT GCT 1008 
Arg Glu Ser Asp Pro Pro Thr Met Ala Cys Thr Arg Pro Pro Ser Ala 
325 330 335 

CCT CGG AAT GCC ATC TCA AAT GTT AAT GAA ACT AGT GTC TTT CTG GAA 1056 
Pro Arg Asn Ala lie Ser Asn Val Asn Glu Thr Ser yal Phe Leu Glu K 
340 ' 345 , 350 

TGG ATT CCG CCT GCT GAC ACT GGT GGA AGG AAA GAC GTG TCA TAT TAT 1104 
Trp lie Pro Pro Ala Asp Thr Gly Gly Arg Lys Asp Val Ser Tyr Tyr 
355 ( 360 365 

i 1 

ATT GCA TGC AAG AAG TGC AAC TCC CAT GCA GGT GTG TGT GAG GAG TGT 1152 
lie Ala Cys Lys, Lys Cys Asn Ser His Ala Gly Val Cys 'Glu Glu Cys 
370 375 380 

GGC GGT CAT GTC AGG TAC CTT CCC CGG CAA AGC GGC CTG AAA AAC ACC 1200 
Gly Gly His Val Arg Tyr Leu Pro Arg Gin Ser Gly Leu Lys Asn Thr 

385 . _ 390 395 y 400 ; 

TCT GTC ATG ATG GTG GAT CTA CTC GCT CAC ACA AAC TAT ACC TTT GAG 1248 
Ser Val Met Met Val Asp Leu Leu Ala His Thr Asn Tyr Thr Phe Glu 
( 405 410 415 

ATT GAG GCA GTG AAT GGA GTG TCC GAC TTG AGC CCA GGA GCC CGG CAG 1296 
lie Glu Ala Val Asn Gly Val Ser Asp Leu Ser Pro Gly Ala Arg Gin 
420 425 430 

TAT GTG TCT GTA AAT GTA ACC ACA AAT CAA <5CA GCT CCA TCT CCA GTC 1344 
Tyr Val Ser Val Asn Val Thr Thr Asn Gin Ala Ala Pro Ser Pro Val 
435 440 445 

ACC AAT GTG AAA AAA GGG AAA ATT GCA AAA AAC AGC ATC TCT TTG TCT 1392 
Thr Asn Val Lys Lys Gly Lys lie Ala Lys Asn Ser lie Ser Leu Ser 
450 455 460 

TGG CAA GAA CCA GAT CGT CCC AAT GGA ATC ATC CTA GAG TAT GAA ATC 1440 
Trp Gin Glu Pro Asp Arg Pro Asn Gly lie lie Leu Glu Tyr Glu lie 
465 470 475 480 

AAG CAT TTT GAA AAG GAC CAA GAG ACC AGC TAC ACG ATT ATC AAA TCT 1488 
Lys His Phe Glu Lys Asp Gin Glu Thr Ser Tyr Thr lie lie Lys Ser 
485 490 495 

AAA GAG ACA ACT ATT ACT GCA GAG GGC TTG AAA CCA GCT TCA GTT TAT 1536 
Lys Glu Thr Thr lie Thr Ala Glu Gly Leu Lys Pro Ala Ser Val Tyr 
500 505 510 

GTC TTC CAA ATT CGA GCA CGT ACA GCA GCA GGC TAT GGT GTC TTC AGT 1584 
Val Phe Gin lie Arg Ala Arg Thr Ala Ala .Gly Tyr Gly Val Phe Ser 
515 520 525 

CGA AGA TTT GAG TTT GAA ACC ACC CCA GTG TTT GCA GCA TCC AGC GAT 1632 
Arg Arg Phe Glu Phe Glu Thr Thr Pro Val Phe Ala Ala Ser Ser Asp 
530 535 540 
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CAA AGC CAG ATT CCT GTA ATT GCT GTG TCT GTG ACA GTA GGA GTC ATT 1680 
Gin Ser Gin He Pro Val He Ala Val Ser Val Thr Val Gly Val He 
54 $ 550 555 560 

i 

TTG TTG GCA GTG GTT ATC GGC GTC CTC CTC AGT GGA AGG CGG TGT GGC 1728 
Leu Leu Ala Val Val He Gly Val Leu' Leu Ser Gly Arg Arg Cys Gly 
565 570 575 

TAC AGC AAA GCA AAA CAA GAT CCA GAA GAG GAA AAG ATG CAT TTT CAT 1776 
Tyr Ser Lys Ala Lys Gin Asp Pro Glu Glu Glu Lys Met His Phe His 
580 585 i 590 

AAT GGG CAC ATT AAA CTG CCA GGA GTA AGA ACT TAC ATT GAT CCA CAT* 1824 
Asn Gly His He Lys Leu Pro Gly Val Arg Thr Tyr He Asp Pro His 
595 600 605 

ACC TAT GAG GAT CCC AAT CAA GCT GTC CAC GAA TTT GCC AAG GAG ATA 1872 
Thr Tyr Glu Asp Pro Asn Gin Ala Val His Glu Phe Ala Lys Glu He 
610 615 620 

GAA GCA TCA TGT ATC ACC ATT GAG AGA GTT ATT GGA GCA GGT GAA TTT 1920 
Glu Ala Ser Cys He Thr He Glu Arg Val He Gly Ala Gly Glu Phe 
625 630 , 635 640 , 

GGT GAA GTT TGT AGT GGA CGT TTG AAA CTA CCA GGA AAA AGA GAA TTA 1968 
Gly Glu Val Cys Ser Gly Arg Leu Lys Leu Pro Gly Lys Arg Glu Leu 
645 650 655 

CCT GTG GCT ATC AAA ACC CTT AAA GTA GGC TAT ACT GAA AAG CAA CGC 2016 
Pro Val Ala He Lys Thr Leu Lys Val Gly Tyr Thr Glu Lys Gin Ara 
660 ' 665 670 

i 

AGA GAT TTC CTA GGT GAA GCA AGT ATC ATG GGA CAG TTT GAT CAT CCT 2064 
Arg Asp Phe Leu Gly Glu Ala Ser He Met Gly Gin Phe Asp His Pro 
675 680 685 

AAC ATC ATC CAT TTA GAA GGT GTG GTG ACC AAA AGT AAA CCA GTG ATG 2112 
Asn He He His Leu Glu Gly Val Val Thr Lys Ser Lys Pro Val Met 
690 695 700 

ATC GTG ACA GAG TAT ATG GAG AAT GGC TCT TTA GAT ACA TTT TTG AAG 2160 
He Val Thr Glu Tyr Met Glu Asn Gly Ser Leu Asp Thr Phe Leu Lys 
7 °5 710 715 720 

AAA AAC GAT GGG CAG TTC ACT GTG ATT CAG CTT GTT GGC ATG CTG AGA 2208 
Lys Asn Asp Gly Gin Phe Thr Val He Gin Leu Val Gly Met Leu Arg 
725 730 735 

GGT ATC TCT GCA GGA ATG AAG TAC CTT TCT GAC ATG GGC TAT GTG CAT 2256 
Gly He Ser Ala Gly Met Lys Tyr Leu Ser Asp Met Gly Tyr Val His 
740 745 750 

AGA GAT CTT GCT GCC AGA AAC ATC TTA ATC AAC AGT AAC CTT GTG TGC 2304 
Arg Asp Leu Ala Ala Arg Asn He Leu He Asn Ser Asn Leu Val Cys 
755 760 765 
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AAA GTG TCT GAC TTT GGA CTT TCC CGG GTA CTG 
Lys Val Ser Asp Phe Gly Leu Ser Arg Val Leu 
770 775 



GAA GAT GAT CCC GAG 
Glu Asp Asp Pro Glu 
780 



2352 



GCA GCC TAC ACC ACA AGG GGA GGA AAA ATT CCA 
Ala Ala Tyr Thr Thr Arg Gly Gly Lys He Pro 
785 790 795 



ATC AGA TGG ACT GCC 
lie Arg Trp Thr Ala 
800 



2400 



CCA GAA' GCA ATA GCT TTC CGA AAG TTT ACT TCT 
Pro Glu Ala He Ala Phe Arg Lys Phe Thr Se,r 
805 810 



GCC AGT GAT GTC TGG 
Ala Ser Asp Val Trp 
815 



2448 



AGT TAT GGA ATA GTA ATG TGG GAA GTT GTG TCT 
Ser Tyr Gly He Val Met Trp Glu Val Val Ser 
820 825 



-TAT GGA GAG AGA CCC 
Tyr Gly Glu Arg Pro 
830 



2496 



TAC TGG GAG ATG ACC AAT CAA GAT GTG ATT AAA 
Tyr Trp Glu Met Thr Asn Gin Asp Val He Lys 
._ .835 840 



GCG GTA GAG J3AA GGC 
Ala Val Glu Glu Gly 
845 



2544 



TAT CGT CTG CCA AGC CCC ATG GAT TGT CCT GCT 
Tyr Arg Leu Pro Ser Pro Met /Asp Cys Pro Ala 
850 855 



GCT CTC TAT CAG TTA 
Ala Leu Tyr Gin Leu 
860 



2592 



ATG CTG GAT TGC TGG CAG AAA GAG CGA AAT AGC 
Met Leu Asp Cys Trp Gin Lys Glu Arg Asn Ser 
865 870 875 



AGG CCC AAG TTT GAT 
Arg Pro Lys Phe Asp 
880 



2640 



GAA ATA GTC AAC ATG TTG GAC AAG CTG ATA CGT 
Glu He Val Asn Met Leu Asp tys Leu He Arg 
885 890 



AAC CCA AGT AGT CTG 
Asn Pro Ser Ser Leu 
895 



2688 



AAG ACG CTG GTT AAT GCA TCC TGC AGA GTA TCT 
Lys Thr Leu Val Asn Ala Ser Cys Arg Val Ser 
900 905 



AAT TTA TTG GCA GAA 
Asn Leu Leu Ala Glu 
910 



2736 



CAT AGC CCA CTA GGA TCT GGG GCC TAC AGA TCA 
His Ser Pro Leu Gly Ser Gly Ala Tyr Arg Ser 
915 920 



GTA GGT GAA TGG CTA 
Val Gly Glu Trp Leu 
925 



2784 



GAG GCA ATC AAG ATG GGC CGG TAT ACA GAG ATT 
Glu Ala He Lys Met Gly Arg Tyr Thr Glu He 
930 935 



TTC ATG GAA AAT GGA 
Phe Met Glu Asn Gly 
940 



2832 



TAC AGT TCA ATG GAC GCT GTG GCT CAG GTG ACC 
Tyr Ser Ser Met Asp Ala Val Ala Gin Val Thr 
945 950 955 



TTG GAG GAT TTG AGA 
Leu Glu Asp Leu Arg 
960 



2880 



CGG CTT GGA GTG ACT CTT GTC GGT CAC CAG AAG 
Arg Leu Gly Val Thr Leu Val Gly His Gin Lys 
965 970 



AAG ATC ATG AAC AGC 
Lys He Met Asn Ser 
975 



2928 



CTT CAA GAA ATG AAG GTG CAG CTG GTA AAC GGA 
2983 

Leu Gin Glu Met Lys Val Gin Leu Val Asn Gly 
980 985 



ATG GTG CCA TTG TAACTTCATG 

Met Val Pro Leu 
990 



TAAATGTCGC TTCTTCAAGT GAATGATTCT GCACTTTGTA AACAGCACTG AGATTTATTT 



3043 
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TAACAAAAAA AGGGGGAAAA GGGAAAACAG TGATTTCTAA ACCTTAGAAA ACATTTGCCT 3103 
CAGCCACAGA ATTTGTAATC ATGGTTTTAC TGAAGTATCC AGTTCTTAGT CCTTAGTCT 3162 

(2) INFORMATION FOR SEQ ID NO: 13: ^ * 

i 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 991 amino acids 

(B) TYPE : t amino acid 
. (D) TOPOLOGY: linear 

t i 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Pro Ala Ser Leu Ala Gly' Cys Tyr Ser Ala Pro Arg Arg Ala Pro Leu 
1 5 10 is 

Trp Thr Cys Leu Leu Leu Cys Ala Ala Leu Arg Thr Leu Leu Ala Ser 1 
20 25 30 

Pro Ser Asn Glu Val Asn Leu Leu Asp Ser Arg Thr Val Met Gly Asp 
35 40 45 

Leu Gly Trp He Ala Phe Pro Lys Asn Gly Trp Glu Glu He Gly Glu 
50 55 60 

Val Asp Glu Asn Tyr Ala Pro He His Thr Tyr, Gin Val Cys Lys Val 
65 70 7$ 80 

Met Glu Gin Asn Gin Asn Asn Trp Leu Leu Thr Ser Trp He Ser Asn ' 
85 90 95 

Glu Gly Ala Ser Arg He Phe He Glu Leu Lys Phe Thr Leu Arg Asp 
100 105 HO 

Cys Asn Ser Leu Pro Gly Gly Leu Gly Thr Cys Lys Glu Thr Phe Asn 
115 120 125 

Met Tyr Tyr Phe Glu Ser Asp Asp Gin Asn Gly Arg Asn He Lys Glu 
130 135 140 

Asn Gin Tyr He Lys He Asp Thr He Ala Ala Asp Glu Ser Phe Thr 
145 150 155 160 

Glu Leu Asp Leu Gly Asp Arg Val Met Lys Leu Asn Thr Glu Val Arg 
165 170 175 

Asp Val Gly Pro Leu Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp 
180 185 190 

Val Gly Ala Cys He Ala Leu Val Ser Val Arg Val Tyr Tyr Lys Lys 
195 200 205 

Cys Pro Ser Val Val Arg His Leu Ala Val Phe Pro Asp Thr He Thr 
210 215 220 
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Glv Ala Asp Ser Ser Gin Leu Leu Glu Val Ser Gly Ser Cys Val Asn 
225 230 ( 235 , 240 

His Ser Val Thr Asp Glu Pro Pro Lys Met His Cys Ser Ala Glu Gly 
245 250 255 

Glu Trp Leu Val Pro He Gly Lys Cys Met Cys Lys Ala Gly Tyr Glu 
260 265 , 270 

Glu Lys Asn Gly Thr Cys Gin Val Cys Arg Pro Gly Phe Phe Lys Ala 
275 280 285 

Ser Pro His He Gin Ser Cys Gly Lys Cys Pro Pro His Ser Tyr Thr 
290 295 300 

His Glu Glu Ala Ser Thr Ser Cys Val Cys Glu Lys Asp Tyr Phe Arg 
305 310 315 320 

Arg Glu Ser Asp Pro Pro Thr Met Ala Cys Thr Arg Pro Pro Ser Ala 
325 330 335 

Pro Arg Asn Ala lie Ser Asn Val Asn Glu Thr Ser Val Phe Leu Glu 
340 345 350 

Trp He Pro Pro Ala Asp Thr Gly Gly Arg Lys Asp Val Ser Tyr Tyr 
355 360 365 

He Ala Cys Lys Lys Cys Asn Ser His Ala Gly Val Cys Glu Glu Cys 
370 375 i 380 

Gly Gly His Val Arg Tyr Leu Pro Arg Gin Ser Gly Leu Lys Asn Thr 
385 390 395 400 

Ser Val Met Met Val Asp Leu Leu Ala His Thr Asn Tyr Thr Phe Glu 
405 410 415 

He Glu Ala Val Asn Gly Val Ser Asp Leu Ser Pro Gly Ala Arg Gin 
420 425 430 

Tyr Val Ser Val Asn Val Thr Thr Asn Gin Ala Ala Pro Ser Pro Val 
435 440 445 

Thr Asn Val Lys Lys Gly Lys He Ala Lys Asn Ser He Ser Leu Ser 
450 455 460 

Trp Gin Glu Pro Asp Arg Pro Asn Gly He He Leu Glu Tyr Glu He 
465 470 475 480 

Lys His Phe Glu Lys Asp Gin Glu Thr Ser Tyr Thr He He Lys Ser 
485 490 495 

Lys Glu Thr Thr He Thr Ala Glu Gly Leu Lys Pro Ala Ser Val Tyr 
500 505 510 

Val Phe Gin He Arg Ala Arg Thr Ala Ala Gly Tyr Gly Val Phe Ser 
515 520 525 
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Arg Arg Phe Glu Phe Glu Thr Thr Pro Val Phe Ala Ala Ser Ser Asp 
530 535 540 

Gin Ser Gin lie Pro Val He Ala Val Ser Val Thr Val Gly Val He 
545 550 , 555 560 

Leu Leu Ala Val VaJ. He Gly Val Leu Leu' Ser Gly Arg Arg Cys Glv 
565 570 , 575 

Tyr Ser Lys Ala Lys Gin Asp Pro Glu Glu Glu Lys Met His Phe His 
580 , 585 590 

Asn Gly His lie Lys Leu Pro Gly Val Arg, Thr Tyr He Asp Pro His 
600 605 



Thr Tyr Glu Asp Pro Asn Gin Ala Val His Glu Phe Ala Lys Glu He 
610 , 615 620 

Glu Ala Ser Cys He Thr He Glu Arg' Val He Gly Ala Gly Glu Phe 
625 630 635 , 640 

Gly Glu Val Cys Ser Gly Arg Leu Lys Leu Pro Gly Lys Arg Glu Leu 
645 650 ' 655 

Pro Val Ala He Lys Thr Leu Lys Val Gly Tyr Thr Glu Lys Gin Arc 
660 665 670 

Arg Asp Phe Leu Gly Glu Ala Ser He Met Gly Gin Phe Asp His Pro 
675 680 685 

ASn lit 116 H±S Leu Glu Gly Val Val Thr Lys' Ser Lys Pro Val Met 
65,0 695 700 

He Val Thr Glu Tyr Met Glu Asn Gly Ser Leu Asp Thr Phe Leu Lys 

710 715 720 

Lys Asn Asp Gly Gin Phe Thr Val He Gin Leu Val Gly Met Leu Aro 
725 730 735 

Gly He Ser Ala Gly Met Lys Tyr Leu Ser Asp Met Gly Tyr Val His 
740 745 75 0 

Arg Asp Leu Ala Ala Arg Asn He Leu He Asn Ser Asn Leu Val Cys 
755 760 765 

Lys Val Ser Asp Phe Gly Leu Ser Arg Val Leu Glu Asp Asp Pro Glu 
110 775 780 

Ala Ala Tyr Thr Thr Arg Gly Gly Lys He Pro He Arg Trp Thr Ala 
785 790 795 eoo 

Pro Glu Ala He Ala Phe Arg Lys Phe Thr Ser Ala Ser Asp Val Tro 
805 810 815 ^ 

Ser Tyr Gly He Val Met Trp Glu Val Val Ser Tyr Gly Glu Arg Pro 
820 8 25 8 3o 
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Tyr Trp Glu Met Thr Asn Gin Asp Val lie Lys Ala Val Glu Glu Gly 
835 840 845 

Tyr Arg Leu Pro Ser Pro, Met Asp Cys Pro Ala Ala Leu Tyr Gin Leu 
850 855 , 860 , 

Met Leu Asp Cys Trp Gin Lys.Glu Arg Asn Ser ,Arg Pro Lys Phe Asp 
865 870 875, 880 

Glu lie Val Asn Met Leu Asp Lys Leu He Arg Asn Pro Ser Ser Leu 
885 890 895 

, f 

Lys Thr Leu Val Asn Ala Ser Cys Arg Val Ser Asn Leu Leu Ala Glu 
900 , 905 910 

His Ser Pro Leu Gly Ser Gly Ala Tyr Arg Ser Val Gly Glu Trp Leu 
915 ' 920 , 925 

Glu Ala He Lys Met Gly Arg Tyr Thr Glu He Phe Met Glu Asn Gly . 
930 935 940 

Tyr Ser Ser Met Asp Ala Val Ala Gin Val Thr Leu Glu Asp Leu Arg 
945 , 950 955 960 

Arg Leu Gly Val Thr Leu Val Gly His Gin Lys Lys He Met Asn Ser 
965 970 975 

Leu Gin Glu Met Lys Val Gin Leu Val Asn Gly Met Val Pro Leu 
980 985 ( , 990 

! 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3116 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 34.. 2994 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

AAGCGGCAGG AGCAGCGTTG GCACCGGCGA ACC ATG GCT GGG ATT TTC TAT TTC 54 

Met Ala Gly He Phe Tyr Phe 
1 5 

GCC CTA TTT TCG TGT CTC TTC GGG ATT TGC GAC GCT GTC ACA GGT TCC 102 
Ala Leu Phe Ser Cys Leu Phe Gly He Cys Asp Ala Val Thr Gly Ser 
10 15 20 
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AGG GTA TAC CCC GCG AAT GAA GTT ACC TTA TTG GAT TCC AGA TCT GTT iso 
Arg Val Tyr Pro Ala Asn Glu Val Thr Leu Leu Asp Ser Arg Ser Val 
25 30 35 

CAG GGA GAA CTT GGG TGG ATA GCA AGC CCT CTG GAA GGA GGG TGG GAG lQfi 
Gin Gly Glu Leu Gly Trp lie Ala Ser Pro Leu Glu Gly Gly Trp Glu 
40 . « 50 55 

GAA GTG AGT ATC ATG GAT GAA AAA AAT ACA CCA ATC CGA ACC TAC CAA 246 
Glu Val Ser He Met Asp Glu Lys Asn Thr Pro He Arg Thr Tyr Gin 
60 65 , 70 

GTG TGC AAT GTG ATG GAA CCC AGC CAG AAT AAC TGG CTA CGA ACT GAT ' 294 
Val Cys Asn Val Met Glu Pro Ser Gin Asn Asn Trp Leu Arg Thr Asp 
75 80 85 

TGG ATC ACC CGA GAA GGG GCT CAG AGG GTG TAT ATT GAG ATT AAA TTC 342 
Trp He Thr Arg Glu Gly Ala Gin Arg Val Tyr He Glu He Lys Phe 
90 95 ioo ' 

ACC TTG AGG GAC TGC AAT AGT CTT CCG GGC GTC ATG GGG ACT TGC AAG 390 
Thr Leu Arg Asp Cys Asn Ser Leu Pro Gly Val Met Gly Thr Cys £s 

GAG ACG TTT AAC CTG TAC TAC TAT GAA TCA GAC AAC GAC AAA GAG CGT 438 
Glu Thr Phe Asn Leu Tyr Tyr Tyr Glu Ser Asp Asn Asp Lys Glu Arg 
120 125 130 13 f 

5£ t? C ^ G ^ ^ TTT GTC ATT GAC ACC ATT GCT GCT GAT 486 

Phe He Arg Glu Asn Gin Phe Val Lys He Asp Thr He Ala Ala Asp 

140 145 150 



GAG AGC TTC ACC CAA GTG GAC ATT GGT GAC AGA ATC ATG AAG CTG AAC 
Glu Ser Phe Thr Gin Val Asp He Gly Asp Arg He Met Lys 2S j£n 
155 160 i 65 

ACC GAG ATC CGG GAT GTA GGG CCA TTA AGC AAA AAG GGG TTT TAC CTG 5fl? 
Thr Glu lie Arg Asp Val Gly Pro Leu Ser Lys Lys Gly Phe Tyr 
170 175 iso 

GCT TTT CAG GAT GTG GGG GCC TGC ATC GCC CTG GTA TCA GTC CGT GTG 630 
Ala Phe Gin Asp Val Gly Ala Cys He Ala Leu Val Ser Val Arg Val 
185 190 195 



TTC TAT AAA AAG TGT CCA CTC ACA GTC CGC AAT CTG GCC CAG TTT CCT 678 
Phe Tyr Lys Lys Cys Pro Leu Thr Val Arg Asn Leu Ala Gin Phe Pro 
* uu 205 210 215 

GAC ACC ATC ACA GGG GCT GAT ACG TCT TCC CTG GTG GAA GTT CGA GGC not 
Asp Thr He Thr Gly Ala Asp Thr Ser Ser Leu Val Glu vll 2£ S 
22 0 225 230 

TCC TGT GTC AAC AAC TCA GAA GAG AAA GAT GTG CCA AAA ATG TAC TGT 774 
Ser Cys Val Asn Asn Ser Glu Glu Lys Asp Val Pro Lys Met Tyr Cys 
235 240 245 
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GGG GCA GAT GGT GAA TGG CTG GTA CCC ATT GGC AAC TGC CTA TGC AAC 822 

Gly Ala Asp Gly Glu Trp Leu Val Pro lie Gly Asn Cys Leu Cys Asn 
250 255 , • 260 

GCT GGG CAT GAG GAG CGG AGC GGA GAA TGC CAA GCT TGC AAA ATT GGA 870 

Ala Gly His Glu Glu Arg Ser Gly Glu Cys Gin Ala Cys Lys lie Gly 
265 270 275 

TAT TAC AAG GCT CTC TCC ACG GAT GCC ACC TGT GCC AAG TGC CCA CCC 918 

Tyr Tyr Lys Ala Leu Ser Thr Asp Ala Thr Cys Ala Lys Cys Pro Pro 

280 285 290 295 

CAC AGC TAC TCT GTC TGG GAA GGA GCC ACC TCG TGC ACC TGT GAC CGA 966 

His— Ser-Tyr-Ser-^Va-l~-Trp -Giu-Gly-Ala— Thr SerCys~Thr- Cys-Asp Arg 

300 305 310 



GGC TTT TTC AGA GCT GAC AAC GAT GCT GCC TCT ATG CCC TGC ( ACC CGT 
Gly Phe Phe Arg Ala Asp Asn Asp Ala Ala Ser Met Pro Cys Thr Arg 
315 320 325 



1014 



CCA CCA TCT GCT CCC CTG AAC TTG ATT TCA AAT GTC AAC GAG ACA TCT 1062 
Pro Pro Ser Ala Pro Leu Asn / Leu lie Ser Asn Val Asn Glu Thr Ser • t 
330 335 340 

GTG AAC TTG GAA TGG AGT AGC CCT CAG AAT ACA GGT GGC CGC CAG GAC 1110 
Val Asn Leu Glu Trp Ser Ser Pro Gin Asn Thr Gly Gly Arg Gin Asp 
345 350 355 



ATT TCC TAT AAT GTG GTA TGC AAG AAA TGT GGA GCT GGT GAC CCC AGC 
lie Ser Tyr Asn Val Val Cys ! Lys Lys Cys Gly Ala Gly Asp Pro Ser 
360 365 370 ' 375 



1158 



AAG TGC CGA CCC TGT GGA AGT GGG GTC CAC TAC ACC CCA CAG CAG AAT 
Lys Cys Arg Pro Cys Gly Ser Gly Val His Tyr Thr Pro Gin Gin Asn 
380 385 390 



1206 



GGC TTG AAG ACC ACC AAA GTC TCC ATC ACT GAC CTC CTA GCT CAT ACC 
Gly Leu Lys Thr Thr Lys Val Ser lie Thr Asp Leu Leu Ala His Thr 
395 400 405 



1254 



AAT TAC ACC TTT GAA ATC TGG GCT GTG AAT GGA GTG TCC AAA TAT AAC 1302 
Asn Tyr Thr Phe Glu lie Trp Ala Val Asn Gly Val Ser Lys Tyr Asn 
410 415 420 

CCT AAC CCA GAC CAA TCA GTT TCT GTC ACT GTG ACC ACC AAC CAA GCA 1350 
Pro Asn Pro Asp Gin Ser Val Ser Val Thr Val Thr Thr Asn Gin Ala 
425 430 435 



GCA CCA TCA TCC ATT GCT TTG GTC CAG GCT AAA GAA GTC ACA AGA TAC 1398 
Ala Pro Ser Ser lie Ala Leu Val Gin Ala Lys Glu Val Thr Arg Tyr 
440 445 450 455 

AGT GTG GCA CTG GCT TGG CTG GAA CCA GAT CGG CCC AAT GGG GTA ATC 1446 
Ser Val Ala Leu Ala Trp Leu Glu Pro Asp Arg Pro Asn Gly Val lie 
460 465 470 
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CTG GAA TAT GAA GTC AAG TAT TAT GAG AAG GAT CAG AAT GAG CGA AGC 1494 
Leu Glu Tyr Glu Val Lys Tyr Tyr Glu Lys Asp Gin Asn Glu Arg Ser 
475 480 485 

TAT CGT ATA GTT CGG ACA iGCT GCC AGG AAC ACA GAT ATC AAA GGC CTG ' 1542 

Tyr Arg lie Val Arg Thr Ala Ala Arg Asn Thr Asp lie Lys Gly Leu 

490 . , 495 500 4 

i 

AAC CCT CTC ACT TCC TAT GTT TTC CAC GTG CGA, GCC AGG ACA GCA GCT 1590 
Asn Pro Leu Thr Ser Tyr Val Phe His Val Arg Ala Arg Thr Ala Ala 
S05 , 510 515 

GGC TAT GGA GAC TTC AGT GAG CCC TTG GAG GTT ACA ACC AAC ACA GTG 1638 
Gly Tyr Gly Asp Phe Ser Glu Pro Leu Glu Val Thr Thr Asn Thr Val 
520 , 525 .530 535 

CCT TCC CGG ATC ATT GGA GAT GGG GCT AAC TCC ACA GTC CTT CTG GTC 1686 
Pro Ser Arg He He Gly Asp Gly Ala Asn Ser Thr Val Leu Leu Val 
540 1 545 550 

TCT GTC TCG GGC AGT GTG GTG CTG GTG GTA ATT CTC ATT GCA GCT TTT 1734 
Ser Val Ser Gly Ser Val Val Leu Val Val He Leu He Ala Ala Phe 
555 560 ' 565 

GTC ATC AGC CGG AGA CGG AGT AAA TAC AGT AAA GCC ' AAA CAA GAA GCG 1782 
Val He Ser Arg Arg Arg Ser Lys Tyr Ser Lys Ala Lys Gin Glu Ala 
570 575 580 

GAT GAA GAG AAA CAT TTG AAT CAA GGT GTA AGA ACA TAT GTG GAC CCC 1830 
Asp Glu Glu Lys His Leu Asn Gin Gly Val Arg t Thr Tyr Val Asp Pro 
585 590 1 595 

TTT ACG TAC GAA GAT CCC AAC CAA GCA GTG CGA GAG TTT GCC AAA GAA 1878 
Phe Thr Tyr Glu Asp Pro Asn Gin Ala Val Arg Glu Phe Ala Lys Glu 
600 605 610 615 

ATT GAC GCA TCC TGC ATT AAG ATT GAA AAA GTT ATA GGA GTT GGT GAA 1926 
He Asp Ala Ser Cys He Lys He Glu Lys Val He Gly Val Gly Glu 
«0 625 630 

TTT GGT GAG GTA TGC AGT GGG CGT CTC AAA GTG CCT GGC AAG AGA GAG 1974 
Phe Gly Glu Val Cys Ser Gly Arg Leu Lys Val Pro Gly Lys Arg Glu 
635 640 645 

ATC TGT GTG GCT ATC AAG ACT CTG AAA GCT GGT TAT ACA GAC AAA CAG 2022 
He Cys Val Ala He Lys Thr Leu Lys Ala Gly Tyr Thr Asp Lys Gin 
650 655 660 

AGG AGA GAC TTC CTG AGT GAG GCC AGC ATC ATG GGA CAG TTT GAC CAT 2070 
Arg Arg Asp Phe Leu Ser Glu Ala Ser He Met Gly Gin Phe Asp His 
665 670 675 

CCG AAC ATC ATT CAC TTG GAA GGC GTG GTC ACT AAA TGT AAA CCA GTA 2118 
Pro Asn He He His Leu Glu Gly Val Val Thr Lys Cys Lys Pro Val 
680 685 690 695 
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ATG ATC ATA ACA GAG TAC ATG GAG AAT GGC TCC TTG GAT GCA TTC CTC 2166 
Met He He Thr Glu Tyr Met Glu Asn Gly Ser Leu Asp Ala Phe Leu 
700 705 710 

AGG AAA AAT GAT GGC AGATTT ACA GTC AT£ CAG CTG GTG GGC ATG CTT ^ 2214 
Arg Lys Asn Asp Gly Arg Phe Thr Val He Gin Leu Val Gly Met Leu 
715 ' 720 , 725 

CGT GGC ATT GGG TCT GGG ATG AAG TAT TTA TCT GAT ATG AGC TAT GTG 2262 
Arg Gly He Gly Ser Gly Met Lys Tyr Leu Ser Asp Met Ser Tyr Val 
730 735 740 

1 . i 

CAT CGT GAT CTG GCC GCA CGG AAC ATC CTG GTG AAC AGC AAC TTG GTC 2310 
His Arg Asp Leu Ala Ala Arg Asn He Leu Val Asn Ser Asn Leu Val 
745 750 755 

TGC AAA GTG TCT GAT TTT 1 GGC ATG TCC CGA GTG CTT GAG GAT GAT CCG 2358 
Cys Lys Val Ser Asp Phe Gly Met Ser Arg Val Leu Glu Asp Asp Pro 

,7.60 . 765 770 .. 775 . 

GAA GCA GCT TAC ACC ACC AGG GGT GGC AAG ATT CCT ATC CGG TGG ACT 2406 
Glu Ala Ala Tyr Thr Thr Arg Gly Gly Lys lie Pro He Arg Trp Thr 
( 780 785 790 

GCG CCA GAA GCA ATT GCC TAT CGT AAA TTC ACA TCA GCA AGT GAT GTA 2454 
Ala Pro Glu Ala He Ala Tyr Arg Lys Phe Thr Ser Ala Ser Asp Val 
795 800 805 

TGG AGC TAT GGA ATC GTT ATG TGG GAA GTG ATG, TCG TAC GGG GAG AGG 2502 
Trp Ser Tyr Gly He Val Met Trp Glu Val Met Ser Tyr Gly Glu Arg 

810 815 820 , 

CCC TAT TGG GAT ATG TCC AAT CAA GAT GTG ATT AAA GCC ATT GAG GAA 2550 
Pro Tyr Trp Asp Met Ser Asn Gin Asp Val He Lys Ala He Glu Glu 
825 830 835 

GGC TAT CGG TTA CCC CCT CCA ATG GAC TGC CCC ATT GCG CTC CAC CAG 2598 
Gly Tyr Arg Leu Pro Pro Pro Met Asp Cys Pro He Ala Leu His Gin 
840 845 850 855 

CTG ATG CTA GAC TGC TGG CAG AAG GAG AGG AGC GAC AGG CCT AAA TTT 2646 
Leu Met Leu Asp Cys Trp Gin Lys Glu Arg Ser Asp Arg Pro Lys Phe 
860 865 870 

GGG CAG ATT GTC AAC ATG TTG GAC AAA CTC ATC CGC AAC CCC AAC AGC 2694 
Gly Gin He Val Asn Met Leu Asp Lys Leu He Arg Asn Pro Asn Ser 
875 880 885 

TTG AAG AGG ACA GGG ACG GAG AGC TCC AGA CCT AAC ACT GCC TTG TTG 2742 
Leu Lys Arg Thr Gly Thr Glu Ser Ser Arg Pro Asn Thr Ala Leu Leu 
890 895 900 

GAT CCA AGC TCC CCT GAA TTC TCT GCT GTG GTA TCA GTG GGC GAT TGG 2790 
Asp Pro Ser Ser Pro Glu Phe Ser Ala Val Val Ser Val Gly Asp Trp 
90S 910 915 
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CTC CAG GCC ATT AAA ATG GAC CGG TAT AAG GAT AAC TTC ACA GCT GCT 2838 
Leu Gin Ala He Lys Met Asp Arg Tyr Lys Asp Asn Phe Thr Ala Ala 
920 925 930 935 



i 



GGT TAT ACC ACA CTA GAG GCT GTG GTG CAC GTG AAC CAG GAG GAC CTG 2886 
Gly Tyr Thr Thr Leu Glu Ala Val Val His Val Asn Gin Glu Asp Leu 
940 945 95Q 

GCA AGA ATT GGT ATC ACA GCC ATC ACG CAC CAG AAT AAG ATT TTG AGC 2934 
Ala Arg He Gly He Thr Ala He Thr His Gin Asn Lya He Leu Ser 
955 960 , 965 

t 

AGT GTC CAG GCA ATG CGA ACC CAA ATG CAG CAG ATG CAC GGC AGA ATG 2982 
Ser Val Gin Ala Met Arg Thr Gin Met Gin Gin Met His Gly Arg Met 
* 7 ° 975 980 

GTT CCC GTC TGAGCCAGTA CTGAATAAAC TCAAAACTCT TGAAATTAGT 3<m 
Val Pro Val 

985 • 

TTACCTCATC CATGCACTTT AATTGAAGAA CTGCACTTTT TTTACTTCGT CTTCGCCCTC 3091 
TGAAATTAAA GAAATGAAAA AAAAA , 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 986 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: liinear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Ala Gly He Phe Tyr Phe Ala Leu Phe Ser Cys Leu Phe Gly He 
1 5 io 



15 



Cys Asp Ala Val Thr Gly Ser Arg Val Tyr Pro Ala Asn Glu Val Thr 
20 25 30 

Leu Leu Asp Ser Arg Ser Val Gin Gly Glu Leu Gly Trp He Ala Ser 
35 40 45 

Pro Leu Glu Gly Gly Trp Glu Glu Val Ser He Met Asp Glu Lys Asn 
50 55 60 

Thr Pro He Arg Thr Tyr Gin Val Cys Asn Val Met Glu Pro Ser Gin 
65 70 75 80 

Asn Asn Trp Leu Arg Thr Asp Trp He Thr Arg Glu Gly Ala Gin Aro 
85 90 95 

Val Tyr He Glu He Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu Pro 
!00 105 no 
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Gly Val Met Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tyr Glu 
115 120 125 

Ser Asp Asn Asp Lys Glu f Arg Phe lie Arg Glu Asn Gin Phe Val Lys 
130 ' 135 ( 140 

lie Asp Thr lie Ala Ala Asp Glu Ser Phe Thr ,Gln Val Asp He Gly 
145 150 155 160 

Asp Arg lie Met Lys Leu Asn Thr Glu He Arg Asp Val Gly Pro Leu 
165 ' 170 175 

I 

Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cys He 
180 , 185 190 

Ala Leu Val Ser Val Arg Val Phe Tyr Lys Lys Cys Pro Leu Thr Val 

195 ' 200 205 

i 

Arg Asn Leu ,Ala Gin Phe Pro Asp Thr He Thr Gly Ala ^sp Thr Ser 
210 215 220 

Ser Leu Val Glu Val Arg Gly Ser Cys Val Asri Asn Ser Glu Glu Lys 
225 ( 230 235 240 

Asp Val Pro Lys Met Tyr Cys Gly Ala Asp Gly Glu Trp Leu Val Pro 
245 250 255 

He Gly Asn Cys Leu Cys Asn Ala Gly His Glu Glu Arg Ser Gly Glu 

260 265 ' , 270 

i 

Cys Gin Ala Cys Lys He Gly Tyr Tyr Lys Ala Leu Ser Thr Asp Ala 
275 280 285 

Thr Cys Ala Lys Cys Pro Pro His Ser Tyr Ser Val Trp Glu Gly Ala 
290 295 300 

Thr Ser Cys Thr Cys Asp Arg Gly Phe Phe Arg Ala Asp Asn Asp Ala 
305 310 315 320 

Ala Ser Met Pro Cys Thr Arg Pro Pro Ser Ala Pro Leu Asn Leu He 
325 330 335 

Ser Asn Val Asn Glu Thr Ser Val Asn Leu Glu Trp Ser Ser Pro Gin 
340 345 350 

Asn Thr Gly Gly Arg Gin Asp He Ser Tyr Asn Val Val Cys Lys Lys 
355 360 365 

Cys Gly Ala Gly Asp Pro Ser Lys Cys Arg Pro Cys Gly Ser Gly Val 
370 375 380 

His Tyr Thr Pro Gin Gin Asn Gly Leu Lys Thr Thr Lys Val Ser He 
385 390 395 400 

Thr Asp Leu Leu Ala His Thr Asn Tyr Thr Phe Glu He Trp Ala Val 
405 410 415 
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Asn Gly Val Ser Lys Tyr Asn Pro Asn Pro Asp Gin Ser Val Ser Vai 
420 425 430 

Thr Val Thr Thr Asn Gin Ala Ala Pro Ser Ser lie Ala Leu Val Gin 
435 440 445 

l 

Ala Lys Glu Val Thr Arg Tyr Ser Val Ala Leu Ala Trp Leu Glu Pro 
450 455 460 

Asp Arg Pro Asn Gly Val He Leu Glu Tyr Glu Val Lys Tvr Tvr 
465 470 



Glu 

47|5 480 



Lys Asp Gin Asn Glu Arg Ser Tyr Arg He Val Arg Thr Ala Ala Ara ' 
485 490 495 



Asn Thr Asp He Lys Gly Leu Asn Pro Leu Thr Ser Tyr Val Phe His 
500 505 5io 

Val Arg Ala Arg Thr Ala Ala Gly Tyr Gly Asp Phe Ser Glu Pro Leu 
515 520 525 

Glu val Thr Thr Asn Thr Val Pro Ser Arg He He Gly Asp Gly Ala 

535 / 540 



Asn Ser Thr Val Leu Leu Val Ser Val Ser Gly Ser Val val Leu Val 

560 



545 550 555 



Val He Leu He Ala Ala Phe Val He Ser Arg Arg Arg Ser Lys Tyr 
565 570 575 

Ser Lys Ala Lys Gin Glu Ala Asp Glu Glu Lys His Leu Asn Gin Glv 
580 585 590 

Val Arg Thr Tyr Val Asp Pro Phe Thr Tyr Glu Asp Pro Asn Gin Ala 
595 600 605 

Val Arg Glu Phe Ala Lys Glu He Asp Ala Ser Cys He Lys He Glu 
610 615 620 

Lys Val He Gly Val Gly Glu Phe Gly Glu Val Cys Ser Gly Arg Leu 
625 630 635 * 640 

Lys Val Pro Gly Lys Arg Glu He Cys Val Ala He Lys Thr Leu Lvs 
645 650 655 

Ala Gly Tyr Thr Asp Lys Gin Arg Arg Asp Phe Leu Ser Glu Ala Ser 
660 665 670 

He Met Gly Gin Phe Asp His Pro Asn He He His Leu Glu Gly Val 
675 680 685 

Val Thr Lys Cys Lys Pro Val Met He He Thr Glu Tyr Met Glu Asn 
690 695 700 

Gly Ser Leu Asp Ala Phe Leu Arg Lys Asn Asp Gly Arg Phe Thr Val 
705 710 7 15 720 
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He Gin I»eu Val Gly Met Leu Arg Gly He Gly Ser Gly Met Lys Tyr 
725 730 735 

Leu Ser Asp Met Ser Tyr Val His Arg Asp Leu Ala Ala Arg Asn He 
740 745 . 750 

Leu Val Asn Ser Ash Leu Val Cys Lys Val Ser Asp Phe Gly Met Ser 
755 760 765 

i 

Arg Val Leu Glu Asp Asp Pro Glu Ala Ala Tyx Thr Thr Arg Gly Gly 
770 775 780 

Lys He Pro He Arg Trp Thr Ala Pro Glu Ala He Ala Tyr Arg Lys 
785 ~ 790 795 800 

Phe Thr Ser Ala Ser Asp Val Trp Ser Tyr Gly He Val Met Trp Glu 
805 810 815 

Val Met Ser Tyr Gly Glu Arg Pro Tyr Trp Asp Met Ser Asn Gin Asp 
820 825 830 

Val He Lys Ala lie Glu Glu jGly Tyr Arg Leu Pro Pro Pro Met Asp 
835 840 845 

Cys Pro He Ala Leu His Gin Leu Met Leu Asp Cys Trp Gin Lys Glu 
850 855 860 

Arg Ser Asp Arg Pro Lys Phe Gly Gin He Val Asn Met Leu Asp Lys 
865 870 875 880 

! 

Leu He Arg Asn Pro Asn Ser Leu Lys Arg Thr Gly Thr Glu Ser Ser 
885 890 895 

Arg Pro Asn Thr Ala Leu Leu Asp Pro Ser Ser Pro Glu Phe Ser Ala 
900 905 910 

Val Val Ser Val Gly Asp Trp Leu Gin Ala He Lys Met Asp Arg Tyr 
915 920 925 

Lys Asp Asn Phe Thr Ala Ala Gly Tyr Thr Thr Leu Glu Ala Val Val 
930 935 940 

His Val Asn Gin Glu Asp Leu Ala Arg He Gly He Thr Ala He Thr 
945 950 955 960 

His Gin Asn Lys He Leu Ser Ser Val Gin Ala Met Arg Thr Gin Met 
965 970 975 

Gin Gin Met His Gly Arg Met Val Pro Val 
980 985 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4529 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) ; TOPPLOGY: linear 

<ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY: 1 CDS , 

(B) LOCATION: 186.. 3182 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

i 

CGGTGCGAGC GAACAGGAGT GGGGGGGAAA TTAAAAAAAG CTAAACGTGG AGCAGCCGAT 60* 

CGGGGACCGA GAAGGGGAAT CGATGCAAGG AGCACACTAA AACAAAAGCT ACTTCGGAAC 120 

AAACAGCATT TAAAAATCCA CGACTCAAGA . TAACTGAAAC CTAAAATAAA ACCTGCTCAT 180 



GCACC ATG GTT TTT CAA ACT CGG TAC CCT TCA TGG ATT ATT TTA TGC 227 
Met Val Phe Gin Thr Arg Tyr Pro Ser Trp lie lie Leu Cys 
1 5 10 

TAC ATC TGG CTG CTC CGC TTT GCA CAC ACA GGG GAG GCG CAG GCT GCG 275 
Tyr lie Trp Leu Leu Arg Phe Ala His Thr Gly, Glu Ala Gin Ala Ala 
15 20 25 30 

AAG GAA GTA CTA CTG CTG GAT TCT AAA GCA CAA CAA ACA GAG TTG GAG ' 323 

Lys Glu Val Leu Leu Leu Asp Ser Lys Ala Gin Gin Thr Glu Leu Glu 
35 40 45 

TGG ATT TCC TCT CCA CCC AAT GGG TGG GAA GAA ATT AGT GGT TTG GAT 371 
Trp lie Ser Ser Pro Pro Asn Gly Trp Glu Glu lie Ser Gly Leu Asp 
50 55 60 

GAG AAC TAT ACC CCG ATA CGA ACA TAC CAG GTG TGC CAA GTC ATG GAG 419 
Glu Asn Tyr Thr Pro He Arg Thr Tyr Gin Val Cys Gin Val Met Glu 
65 70 75 

CCC AAC CAA AAC AAC TGG CTG CGG ACT AAC TGG ATT TCC AAA GGC AAT 467 
Pro Asn Gin Asn Asn Trp Leu Arg Thr Asn Trp He Ser Lys Gly Asn 
80 85 90 

GCA CAA AGG ATT TTT GTA GAA TTG AAA TTC ACC CTG AGG GAT TGT AAC 515 
Ala Gin Arg He Phe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn 
95 100 105 no 

AGT CTT CCT GGA GTA CTG GGA ACT TGC AAG GAA ACA TTT AAT TTG TAC 563 
Ser Leu Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr 
115 120 125 
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TAT TAT GAA ACA GAC TAT GAC ACT GGC AGG AAT ATA AGA GAA AAC CTC 611 
Tyr Tyr Glu Thr Asp Tyr Asp Thr Gly Arg Asn lie Arg Glu Asn Leu 
130 135 140 

TAT GTA AAA ATA GAC ACC ATT GCT GCA GAtf GAA AGT TTT ACC CAA GGT 659 



Tyr Val Lys lie Asp Thr He Ala Ala Asp Glu Ser Phe Thr Gin Gly 
145 150 , 155 



GAC CTT GGT GAA AGA AAG ATG AAG CTT AAC ACT GAG GTG AGA GAG ATT 707 
Asp Leu Gly Glu Arg Lys t Met Lys Leu Asn Thr Glu Val Arg Glu lie 

160 165 170 

I 

i i 

GGA CCT TTG TCC AAA AAG GGA TTC TAT CTT GCC TTT CAG GAT GTA GGG 755 
Gly Pro Leu Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly 
175 180 185 190 

GCT TGC ATA GCT TTG GTT 'TCT GTC AAA GTG TAC TAC AAG AAG TGC TGG 803 
Ala Cys He Ala Leu Val Ser Val Lys 1 Val Tyr Tyr Lys Lys Cys Trp 
195 200 205 

i 

TCC ATT ATT GAG AAC TTA GCT ATC TTT CCA GAT ACA GTG ACT GGT TCA 851 
Ser He He Glu Asn Leu Ala He Phe Pro Asp' Thr Val Thr Gly Ser 
210 ( 215 220 

GAA TTT TCC TCT TTA GTC GAG GTT CGA GGG ACA TGT GTC AGC AGT GCA 899 
Glu Phe Ser Ser Leu Val Glu Val Arg Gly Thr Cys Val Ser Ser Ala 
225 230 235 

GAG GAA GAA GCG GAA AAC GCC CCC AGG ATG CfiLC ( TGC AGT GCA GAA GGA 947 
Glu Glu Glu Ala Glu Asn Ala Pro Arg Met His* Cys Ser Ala Glu Gly 

240 245 250 ; 

GAA TGG TTA GTG CCC ATT GGA AAA TGT ATC TGC AAA GCA GGC TAC CAG 995 
Glu Trp Leu Val Pro He Gly Lys Cys He Cys Lys Ala Gly Tyr Gin 
255 260 265 270 

CAA AAA GGA GAC ACT TGT GAA CCC TGT GGC CGT GGG TTC TAC AAG TCT 1043 
Gin Lys Gly Asp Thr Cys Glu Pro Cys Gly Arg Gly Phe Tyr Lys . Ser 
275 280 285 

TCC TCT CAA GAT CTT CAG TGC TCT CGT TGT CCA ACT CAC AGT TTT TCT 1091 
Ser Ser Gin Asp Leu Gin Cys Ser Arg Cys Pro Thr His Ser Phe Ser 
290 295 300 

GAT AAA GAA GGC TCC TCC AGA TGT GAA TGT GAA GAT GGG TAT TAC AGG 1139 
Asp Lys Glu Gly Ser Ser Arg Cys Glu Cys Glu Asp Gly Tyr Tyr Arg 
305 310 315 

GCT CCA TCT GAC CCA CCA TAC GTT GCA TGC ACA AGG CCT CCA TCT GCA 1187 
Ala Pro Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg Pro Pro Ser Ala 
320 325 330 

CCA CAG AAC CTC ATT TTC AAC ATC AAC CAA ACC ACA GTA AGT TTG GAA 1235 
Pro Gin Asn Leu He Phe Asn He Asn Gin Thr Thr Val Ser Leu Glu 
335 340 345 350 
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TGG AGT CCT CCT GCA GAC AAT GGG GGA AGA AAC GAT GTG ACC TAC AGA 
Trp Ser Pro Pro Ala Asp Asn Gly Gly Arg Asn Asp Val Thr Tyr Arg 
355 360 365 



1283 



ATA TTG TGT AAG CGG TGC AGT TGG GAG CAG GGC GAA TGT GTT CCC TGT 
lie Leu Cys Lys Arg Cys Ser Trp Glu Gin Gly Glu Cys Val Pro Cys 
370 375 380 



1331 



GGG AGT- AAC ATT GGA TAC ATG CCC CAG CAG ACT 'GGA TTA GAG GAT AAC 
Gly Ser Asn lie Gly Tyr Met Pro Gin Gin Thr Gly Leu Glu Asp Asn 
385 390 , 395 

TAT GTC ACT GTC ATG GAC CTG CTA GCC CAC GCT AAT TAT ACT TTT GAA 
Tyr Val Thr Val Met Asp Leu Leu Ala His Ala Asn Tyr Thr Phe Glu 
400 405 410 



1379 



1427 



GTT GAA GCT GTA AAT GGA GTT TCT GAC TTA AGC CGA TCC CAG AGG CTC 
Val Glu Ala Val Asn Gly Val Ser Asp Leu Ser Arg Ser Gin Arg Leu 
415 420 425 430 

TTT GCT GCT GTC AGT ATC ACC ACT GGT CAA GCA GCT CCC TCG CAA GTG 
Phe Ala Ala Val Ser He Thr Thr Gly Gin Ala Ala Pro Ser Gin Val 
435 / 440 445 

AGC GGA GTA ATG AAG GAG AGA GTA CTG CAG CGG AGT GTC GAG CTT TCC 
Ser Gly Val Met Lys Glu Arg Val Leu Gin Arg Ser Val Glu Leu Ser 
450 455 460 

TGG CAG GAA CCA GAG CAT CCC AAT GGA GTC ATC ACA GAA TAT GAA ATC 
Trp Gin Glu Pro Glu His Pro Asn Gly Val lie Thr Glu Tyr Glu He 
465 1470 475 



1475 



1523 



1571 



1619 



AAG TAT TAC GAG AAA GAT CAA AGG GAA CGG ACC TAC TCA ACA GTA AAA 
Lys Tyr Tyr Glu Lys Asp Gin Arg Glu Arg Thr Tyr Ser Thr Val Lys 
480 485 490 



1667 



ACC AAG TCT ACT TCA GCC TCC ATT AAT AAT CTG AAA CCA GGA ACA GTG 
Thr Lys Ser Thr Ser Ala Ser He Asn Asn Leu Lys Pro Gly Thr Val 
495 500 505 510 

TAT GTT TTC CAG ATT CGG GCT TTT ACT GCT GCT GGT TAT GGA AAT TAC 
Tyr Val Phe Gin He Arg Ala Phe Thr Ala Ala Gly Tyr Gly Asn Tyr 
515 520 525 

AGT CCC AGA CTT GAT GTT GCT ACA CTA GAG GAA GCT ACA GGT AAA ATG 
Ser Pro Arg Leu Asp Val Ala Thr Leu Glu Glu Ala Thr Gly Lys Met 
530 535 540 



1715 



1763 



1811 



TTT GAA GCT ACA GCT GTC TCC AGT GAA CAG AAT CCT GTT ATT ATC ATT 
Phe Glu Ala Thr Ala Val Ser Ser Glu Gin Asn Pro Val He He He 
545 550 555 



1859 



GCT GTG GTT GCT GTA GCT GGG ACC ATC ATT TTG GTG TTC ATG GTC TTT 
Ala Val Val Ala Val Ala Gly Thr He He Leu Val Phe Met Val Phe 
560 565 570 



1907 
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GGC TTC ATC ATT GGG AGA AGG CAC TGT GGT TAT AGC AAA GOT GAC CAA 1955 
Gly Phe lie lie Gly Arg Arg His Cys Gly Tyr Ser Lys Ala Asp Gin 
575 580 , 585 » 590 

GAA GGC GAT GAA GAG CTT TAC TTT CAT TTT AAA TTT CCA GGC ACC AAA 2003 
Glu Gly Asp Glu Glu Leu Tyr Phe His Phe Lys Phe Pro Gly Thr Lys 
595 600 605 

ACC TAC ATT GAC CCT GAA ACC TAT GAG GAC CCA AAT AGA GCT GTC CAT 2051 
Thr Tyr lie Asp Pro Glu Thr Tyr Glu Asp Prjo Asn Arg Ala Val His 
610 1 615 620 

CAA TTC GCC AAG GAG CTA GAT GCC TCC TGT ATT AAA ATT GAG CGT GTG 2099 
~~Gln Phe Ala Lys Glu Leu Asp Ala Ser Cys lie Lys lie Glu Arg~Val 
625 630 635 

ATT GGT GCA GGA GAA TTC GGT GAA GTC TGC AGT GGC CGT TTG AAA CTT 2147 
lie Gly Ala Gly Glu Phe Gly Glu Val Cys Ser Gly Arg Leu Lys Leu 
_640 _ J545 650 

CCA GGG AAA AGA GAT GTT GCA GTA GCC ATA AAA ACC CTG AAA GTT GGT 2195 
Pro Gly Lys Arg Asp Val Ala /Val Ala lie Lys Thr Leu Lys Val Gly , 
655 660 665 670 

TAC ACA GAA AAA CAA AGG AGA GAC TTT TTG TGT GAA GCA AGC ATC ATG 2243 
Tyr Thr Glu Lys Gin Arg Arg Asp Phe Leu Cys Glu Ala Ser lie Met 
675 680 685 

GGG CAG TTT GAC CAC CCA AAT GTT GTC CAT TTG GAA GGG GTT GTT ACA 2291 
Gly Gin Phe Asp His Pro Asn *Val Val His Leu Glu Gly Val Val Thr 
690 695 700 

AGA GGG AAA CCA GTC ATG ATA GTA ATA GAG TTC ATG GAA AAT GGA GCC 2339 
Arg Gly Lys Pro Val Met lie Val lie Glu Phe Met Glu Asn Gly Ala 
705 710 715 

CTA GAT GCA TTT CTC AGG AAA CAT GAT GGG CAA TTT ACA GTC ATT CAG 2387 
Leu Asp Ala Phe Leu Arg Lys His Asp Gly Gin Phe Thr Val lie Gin 
720 725 730 

TTA GTA GGA ATG CTG AGA GGA ATT GCT GCT GGA ATG AGA TAT TTG GCT 2435 
Leu Val Gly Met Leu Arg Gly lie Ala Ala Gly Met Arg Tyr Leu Ala 
735 740 745 750 

GAT ATG GGA TAT GTT CAC AGG GAC CTT GCA GCT CGC AAT ATT CTT GTC 2483 
Asp Met Gly Tyr Val His Arg Asp Leu Ala Ala Arg Asn lie Leu Val 
755 760 765 

AAC AGC AAT CTC GTT TGT AAA GTG TCA GAT TTT GGC CTG TCC CGA GTT 2531 
Asn Ser Asn Leu Val Cys Lys Val Ser Asp Phe Gly Leu Ser Arg Val 
770 775 780 

ATA GAG GAT GAT CCA GAA GCT GTC TAT ACA ACT ACT GGT GGA AAA ATT 2579 
He Glu Asp Asp Pro Glu Ala Val Tyr Thr Thr Thr Gly Gly Lys He 
785 790 795 
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■ CCA GTA AGG TGG ACA GCA CCC GAA GCC ATC CAG TAC CGG AAA TTC ACA 2627 
Pro Val Arg Trp Thr Ala Pro Glu Ala lie Gin Tyr Arg Lys Phe Thr 
800 805 810 

TCA GCC AGT GAT GTA TGG, AGC TAT GGA ATA GTC ATG TGG GAA GTT ATG 2675 
Ser Ala Ser Asp Val Trp Ser Tyr Gly lie Val Met Trp Glu Val Met 
815 , ' , 820 825 . 830 ' 

i 

TCT TAT GGA GAA AGA CCT TAT TGG GAC ATG TCA, AAT CAA GAT GTT ATA 2723 
Ser Tyr Gly Glu Arg Pro Tyr Trp Asp Met Ser Asn Gin Asp Val lie 
835 , 840 845 

AAA GCA ATA GAA GAA GGT 'tAT CGT T,TA CCA GCA CCC ATG GAC TGC CCA 2771 
Lys Ala lie Glu Glu Gly Tyr Arg Leu Pro Ala Pro Met Asp Cys Pro 
850 i 855 860 

GCT GGC CTT CAC CAG CTA ( ATG TTG GAT TGT TGG CAA AAG GAG CGT GCT 2819 
Ala Gly Leu His Gin Leu Met Leu Asp Cys Trp Gin Lys Glu Arg Ala 
865 870 ' 875 

GAA AGG CCA AAA TTT GAA CAG ATA GTT GGA ATT CTA GAC AAA ATG ATT 2867 
Glu Arg Pro Lys Phe Glu Gin lie Val Gly He Leu Asp Lys Met He 
880 885 ' 890 

CGA AAC CCA AAT AGT CTG AAA ACT CCC CTG GGA ACT TGT AGT AGG CCA 2915 
Arg Asn Pro Asn Ser Leu Lys Thr Pro Leu Gly Thr Cys Ser Arg Pro 
895 900 905 910 

ATA AGC CCT CTT CTG GAT CAA AAC ACT CCT GAT TTC ACT ACC TTT TGT 2963 
He Ser Pro Leu Leu Asp Gin Asn Thr Pro Asp, Phe Thr Thr Phe Cys 
915 920 1 925 

TCA GTT GGA GAA TGG CTA CAA GCT ATT AAG ATG GAA AGA TAT AAA GAT ^Oll 
Ser Val Gly Glu Trp Leu Gin Ala He Lys Met Glu Arg Tyr Lys As© 
930 935 940 

AAT TTC ACG GCA GCT GGC TAC AAT TCC CTT GAA TCA GTA GCC AGG ATG 3059 
Asn Phe Thr Ala Ala Gly Tyr Asn Ser Leu Glu Ser Val Ala Arg Met 
945 950 955 

ACT ATT GAG GAT GTG ATG AGT TTA GGG ATC ACA CTG GTT GGT CAT CAA 3107 
Thr He Glu Asp Val Met Ser Leu Gly He Thr Leu Val Gly His Gin 
960 965 970 

AAG AAA ATC ATG AGC AGC ATT CAG ACT ATG AGA GCA CAA ATG CTA CAT 3155 
Lys Lys He Met Ser Ser lie Gin Thr Met Arg Ala Gin Met Leu His 
"5 980 985 990 

TTA CAT GGA ACT GGC ATT CAA GTG TGATATGCAT TTCTCCCTTT TAAGGGAGAT 3209 
Leu His Gly Thr Gly He Gin Val 
995 

TACAGACTGC AAGAGAACAG TACTGGCCTT CAG T AT ATGC ATAGAATGCT GCTAGAAGAC 3269 
AAGTGATGTC CTGGGTCCTT CCAACAGTGA AG AG AAG AT T TAAGAAGCAC CTATAGACTT 3329 



GAACTCCTAA GTGCCACGAG AATATATAAA AAGGGAATTT AGGATCCACC ATCGGTGGCC 



3389 



i 
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AGGAAAATAG 


CAGTGACAAT 


AAACAAAGTA 


CTACCTGAAA 


AACATCCAAA 


CACCTTGAGC 


3449 


TCTCTAACCT 


CCTTTTTGTC 


TTATAGACTT* TTTAAAATGT 


ACATAAAGAA 


TTTAAGAAAG 


3509 


AATATATTTG 


TCAAATAAAA 


TCATGATCTT 


ATTGTTAAAA 


TTAATGAAAT 


ATTTTCCTTA 


3569 


AATATGTGAT 


TTCAGACTAT 


TCCTTTTTAA 


AATCATTTGT 


GTTTATTCTT 


CATAAGGACT 


3629 


TTGTTTTAGA 


AAGCTGTTTA 


TAGCTTTGGA 


CCTTTTTAGT 


GTTAAATCTG 


TAACATTACT 


3689 


ACACTGGGTA 


CCTT TGAAAG 


AATCTCAAAT 


TTCAAAAGAA 


ATAGCATGAT 


TGAAGATACA 


3749 


TCTCTGTTAG 


AACATTGGTA 


TCCTTTTTGT 


GCCATTTTAT- 


TCTGTTTAAT 


CAGTGCTGTT 


3809 


TTGATATTGT 


TTGCTAATTG 


GCAGGTAGTC 


AAGAAAATGC 


AAGTTGCCAA 


GAGCTCTGAT 


3869 


ATTTTTTAAA 


AAGAATTTTT 


TTGTAAAGAT 


CAGACAACAC 


ACTATCTTTT 


CAATGAAAAA 

i 


3929 


AGCAATAATG 


ATCCATACAT 


ACTATAAGGC 


ACTTTTAACA 


GATTGTTTAT 


AGAGTGATTT 


3989 


TACTAGAAAG 


AATTTAATAA 


ACTCGAAGTT 


TAGGTTTATG 


AGTATATAAA 


CAAATGAGGC 


4049 


ACTTCATCTG 


AAGAATGTTG 


GTGAAGGCAA 


GTCTCTGAAA 


GCAGAACTAT 


CCAGTGTTAT ' 


4109 


CTAAAAATTA 


ATCTGAGCAC 


ATCAAGATTT 


TTTCATTCTC 


vjiCjAvJATTAb 


ViAAAT T T ALKi 




ATAAATAGTT 


GACATATATT 


TTATATCCTC 


TTCTGTTGAA 


TGCAGTCCAA 


ACATGAAAGG 


4229 


AAATAATTGT 


TTTATATTAT 


AACTCTGAAG 

1 


CATGATAAAG 


GGGCAGTTGA 


CAATTTTCAC 


4289 


CATTTAAACA 


CAAATTTGCT 


GCACAGAATA 


TCACCATTGC 


AGTTCAAAAC 


AAAACAAAAC 


4349 


AAAAAGTCTT 


TTGTTTGTGA 


ACACTGATGC 


AAGAAACTTG 


TTAAATGAAA 


GGACTCTTTA 


4409 


CCCTAGAAGG 


AAGAGGTGAA 


GGATCTGGCT 


TGTTTTTAAA 


GCTTTATTTA 


TTAAACCATA 


4469 


TTATTTGATT 


ACTGTGTTAG 


AATTTCATAA 


GCAATAATTA 


AATGTGTCTT 


TATGGAATTC 


4529 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH : 998 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Val Phe Gin Thr Arg Tyr Pro Ser Trp lie lie Leu Cys Tyr He 
1 5 10 15 

Trp Leu Leu Arg Phe Ala His Thr Gly Glu Ala Gin Ala Ala Lys Glu 
20 25 30 
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Val Leu Leu Leu Asp Ser Lys Ala Gin Gin Thr Glu Leu Glu Trp lie 
35 40 45 

Ser Ser Pro Pro Asn Gly Trp Glu Glu lie Ser Gly Leu Asp Glu Asn 
50 t 55 60 

i . 
Tyr Thr Pro He Arg Thr Tyr Gin Val Cys Gin Val Met Glu Pro Asn 
65 70 75 i 80 

Gin Asn Asn Trp Leu Arg Thr Asn Trp He Ser Lys Gly Asn Ala Gin 
85 , 90 95 

Arg He Phe Val Glu Leu' Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu 
100 105 110 

i 

Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tyr 
115 ( 120 125 

Glu Thr Asp Tyr Asp Thr Gly Arg Asn He Arg Glu Asn Leu Tyr Val 
130 135 140 y 

Lys He Asp Thr He Ala Ala Asp Glu Ser Phe Thr Gin Gly Asp Leu 
145 150 155 160 

Gly Glu Arg Lys Met Lys Leu Asn Thr Glu Val Ardf Glu He Gly Pro 
165 170 175 

Leu Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cys 
180 185 190 

He Ala Leu Val Ser Val Lys Val Tyr Tyr Lys Lys Cys Trp Ser He 
195 200 205 

He Glu Asn Leu Ala He Phe Pro Asp Thr Val Thr Gly Ser Glu Phe 
210 215 220 

Ser Ser Leu Val Glu Val Arg Gly Thr Cys Val Ser Ser Ala Glu Glu 
225 230 235 240 

Glu Ala Glu Asn Ala Pro Arg Met His Cys Ser Ala Glu Gly Glu Trp 
245 250 255 

Leu Val Pro He Gly Lys Cys He Cys Lys Ala Gly Tyr Gin Gin Lys 
260 265 270 

Gly Asp Thr Cys Glu Pro Cys Gly Arg Gly Phe Tyr Lys Ser Ser Ser 
275 280 285 

Gin Asp Leu Gin Cys Ser Arg Cys Pro Thr His Ser Phe Ser Asp Lys 
290 295 300 

Glu Gly Ser Ser Arg Cys Glu Cys Glu Asp Gly Tyr Tyr Arg Ala Pro 
305 310 315 320 

Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg Pro Pro Ser Ala Pro Gin 
325 330 335 
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Asn Leu lie Phe Asn lie Asn Gin Thr Thr Val Ser Leu Glu Trp Ser 
340 345 350 

Pro Pro Ala Asp Asn Gly t Gly Arg Asn Asp Val Thr Tyr Arg lie Leu 
355 360 365 

Cys Lys Arg Cys Ser Trp Glu Gin Gly Glu Cys ,Val Pro Cys Gly Ser 
370 375 , 380 

Asn He Gly Tyr Met Pro Gin Gin Thr Gly Leu Glu Asp Asn Tyr Val 
385 390 395 400 

I 

( i 

Thr Val Met Asp Leu Leu Ala His Ala Asn Tyr Thr Phe Glu Val Glu 
^05 410 415 

Ala Val Asn Gly Val Ser Asp Leu Ser Arg Ser Gin Arg Leu Phe Ala 

420 ' 425 430 

i 

Ala Val Ser lie Thr Thr Gly Gin Ala Ala Pro Ser Gin Val Ser Gly 
435 440 445 

Val Met Lys Glu Arg Val Leu Gin Arg Ser Val Glu Leu Ser Trp Gin 
450 455 460 

1 

Glu Pro Glu His Pro Asn Gly Val He Thr Glu Tyr Glu lie Lys Tyr 
465 470 475 480 

Tyr Glu Lys Asp Gin Arg Glu Arg Thr Tyr Ser Thr Val Lys Thr Lys 
465 490 ' ( 495 

* i 

Ser Thr Ser Ala Ser He Asn Asn Leu Lys Pro Gly Thr Val Tyr Val 
500 505 510 

Phe Gin He Arg Ala Phe Thr Ala Ala Gly Tyr Gly Asn Tyr Ser Pro 
515 520 525 

Arg Leu Asp Val Ala Thr Leu Glu Glu Ala Thr Gly Lys Met Phe Glu 
530 535 . 540 

Ala Thr Ala Val Ser Ser Glu Gin Asn Pro Val He He He Ala Val 
545 550 555 560 

Val Ala Val Ala Gly Thr He He Leu Val Phe Met Val Phe Gly Phe 
565 570 575 

He He Gly Arg Arg His Cys Gly Tyr Ser Lys Ala Asp Gin Glu Gly 
580 585 590 

Asp Glu Glu Leu Tyr Phe His Phe Lys Phe Pro Gly Thr Lys Thr Tyr 
595 600 605 

He Asp Pro Glu Thr Tyr Glu Asp Pro Asn Arg Ala Val His Gin Phe 
610 615 620 

Ala Lys Glu Leu Asp Ala Ser Cys He Lys lie Glu Arg Val He Gly 
625 630 635 640 
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Ala Gly Glu Phe Gly Glu Val Cys Ser Gly Arg Leu Lys Leu Pro Gly 
645 650 655 

Lys Arg Asp Val Ala Val Ala lie Lys Thr Leu Lys Val Gly Tyr Thr 
660 665 670 

Glu Lys Gin Arg Arg Asp Phe Leu Cys Glu Ala Ser lie Met Gly Gin 
675 680 685 

Phe Asp His Pro Asn Val Val His Leu Glu Gly Val Val Thr Arg Gly 
690 695 , 700 

Lys Pro Val Met lie Val lie Glu Phe Met Glu Asn Gly Ala Leu Asp 
7 <>5 710 715 720 

Ala Phe Leu Arg Lys His Asp Gly Gin Phe Thr Val He Gin Leu Val 
725 730 735 

Gly Met Leu Arg Gly He Ala Ala Gly Met Arg Tyr Leu Ala Asp Met 
740 745 750 

Gly Tyr Val His Arg Asp Leu Aia Ala Arg Asn He Leu Val Asn Ser 
755 ,760 765 

Asn Leu Val Cys Lys Val Ser Asp Phe Gly Leu Ser Arg Val He Glu 
770 775 780 

Asp Asp Pro Glu Ala Val Tyr Thr Thr Thr Gly Gly Lys He Pro Val 
785 790 795 800 

Arg Trp Thr Ala Pro Glu Ala I He Gin Tyr Arg Lys Phe Thr Ser Ala 
805 810 8l5 

Ser Asp Val Trp Ser Tyr Gly He Val Met Trp Glu Val Met Ser Tyr 
820 825 830 

Gly Glu Arg Pro Tyr Trp Asp Met Ser Asn Gin Asp Val He Lys Ala 
835 840 845 

He Glu Glu Gly Tyr Arg Leu Pro Ala Pro Met Asp Cys Pro Ala Glv 
850 855 860 

Leu His Gin Leu Met Leu Asp Cys Trp Gin Lys Glu Arg Ala Glu Ara 
865 870 875 880 

Pro Lys Phe Glu Gin He Val Gly He Leu Asp Lys Met He Arg Asn 
885 890 895 

Pro Asn Ser Leu Lys Thr Pro Leu Gly Thr Cys Ser Arg Pro He Ser 
900 905 910 

Pro Leu Leu Asp Gin Asn Thr Pro Asp Phe Thr Thr Phe Cys Ser Val 
915 920 925 

Gly Glu Trp Leu Gin Ala He Lys Met Glu Arg Tyr Lys Asp Asn Phe 
930 935 940 
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Thr Ala Ala Gly Tyr Asn Ser Leu Glu Ser Val Ala Arg Met Thr lie 
945 950 955 960 

j 

Glu Asp Val Met Ser Leu Gly lie Thr Leu Val Gly His Gin Lys Lys 
965 970 975 

lie Met Ser Ser lie Gin Thr Met Arg Ala Gin Met Leu His Leu His 
980 985 990 

t 

Gly Thr Gly He Gin Val 
995 



<2> INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 976 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Glu Leu Gin Ala Ala Arg Ala Cys Phe Ala Leu Leu Trp Gly Cys 
15 10 15 

t 

Ala Leu Ala Ala Ala Ala Ala Ala Gin Gly Lys Glu Val Val Leu Leu 
20 25 30 

Asp Phe Ala Ala Ala Gly Gly Glu Leu Gly Trp Leu Thr His Pro Tyr 
35 40 45 

Gly Lys Gly Trp Asp Leu Met Gin Asn He Met Asn Asp Met Pro He 
50 55 60 

Tyr Met Tyr Ser Val Cys Asn Val Met Ser Gly Asp Gin Asp Asn Trp 
65 70 75 80 

Leu Arg Thr Asn Trp Val Tyr Arg Gly Glu Ala Glu Arg Asn Asn Phe 
85 90 95 

Glu Leu Asn Phe Thr Val Arg Asp Cys Asn Ser Phe Pro Gly Gly Ala 
100 105 110 

Ser Ser Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Ala Glu Ser Asp Leu 
115 120 125 

Asp Tyr Gly Thr Asn Phe Gin Lys Arg Leu Phe Thr Lys He Asp Thr 
130 135 140 

He Ala Pro Asp Glu He Thr Val Ser Ser Asp Phe Glu Ala Arg His 
145 150 155 160 
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Val Lys Leu Asn Val Glu Glu Arg Ser Val Gly Pro Leu Thr Arg Lys 
165 170 175 

Gly Phe Tyr Leu Ala Phe Gin Asp lie Gly Ala Cys Val Ala Leu Leu 
180 185 190 . 

' » 

Ser Val Arg Val Tyr Tyr Lys Lys Cys Pro' Glu Leu Leu Gin Gly Leu 
195 200 205 

Ala His Phe Pro Glu Thr He Ala Gly Ser Asp Ala Pro Ser Leu Ala 
210 , 215 220 

i i 

Thr Val Ala Gly Thr Cys Val Asp His Ala Val Val Pro Pro Gly Gly 
225 ( 230 235 240 - 

Glu Glu Pro Arg Met His Cys Ala Val Asp Gly Glu Trp Leu Val Pro 
245 , 250 255 

He Gly Gin Cys Leu Cys Gin Ala Gly Tyr Glu Lys \Val Glu Asp Ala 

260 265 270 1 

Cys Gin Ala Cys Ser Pro Gly Phe Phe Lys Phe Glu Ala Ser Glu Ser 
275 280 285 

Pro Cys Leu Glu Cys Pro Glu His Thr Leu Pro Ser Pro Glu Gly Ala 
290 295 300 

Thr Ser Cys Glu Cys Glu Glu Gly Phe Phe Arg Ala Pro Gin Aso Pro 
305 310 ' ( ( 315 P 320 

Ala Ser Met Pro Cys Thr Arg Pro Pro Ser Ala Pro His Tyr Leu Thr , 
325 330 335 1 

Ala Val Gly Met Gly Ala Lys Val Glu Leu Arg Trp Thr Pro Pro Gin 
340 345 350 

Asp Ser Gly Gly Arg Glu Asp He Val Tyr Ser Val Thr Cys Glu Gin 
355 360 365 

Cys Trp Pro Glu Ser Gly Glu Cys Gly Pro Cys Glu Ala Ser Val Ara 
370 375 380 

Tyr Ser Glu Pro Pro His Gly Leu Thr Arg Thr Ser Val Thr Val Ser 
385 390 395 400 

Asp Leu Glu Pro His Met Asn Tyr Thr Phe Thr Val Glu Ala Arg Asn 
405 410 415 

Gly Val Ser Gly Leu Val Thr Ser Arg Ser Phe Arg Thr Ala Ser Val 
420 425 430 

Ser He Asn Gin Thr Glu Pro Pro Lys Val Arg Leu Glu Gly Arg Ser 
435 440 445 

Thr Thr Ser Leu Ser Val Ser Trp Ser He Pro Pro Pro Gin Gin Ser 
450 455 460 



I 



WO 95/28484 PCT/DS95/04681 

- 77 - 



Arg Val Trp Lys Tyr Glu Val Thr Tyr Arg Lys Lys Gly Asp Ser Asn 
465 470 475 480 

Ser Tyr Asn Val Arg Arg Thr Glu Gly Phe Ser Val Thr Leu Asp Asp 
485 490 495 

Leu Ala Pro Asp Thr Thr Tyr Leu Val Gin Val Gin Ala Leu Thr Gin 
500 505 ' 510 

Glu Gly Gin Gly Ala Gly Ser Lys Val His Glu Phe Gin Thr Leu Ser 
515 ' 520 525 

I 

Pro Glu Gly Ser Gly Asn Leu Ala Val He Gly Gly Val Ala Val Gly 
530 r 535 540 

Val Val Leu Leu Leu Val Leu Ala Gly Val Gly Phe Phe He His Arg 
545 ' 550 555 560 

Arg Arg Lys Asn Gin Arg Ala Arg Glh Ser Pro Glu Asp Val Tyr Phe 

565 570 575 1 

Ser Lys Ser Glu Gin Leu Lys Pro Leu L^s Thr Tyr Val Asp Pro His 
t 580 585 590 

Thr Tyr Glu Asp Pro Asn Gin Ala Val Leu Lys Phe Thr Thr Glu lie 
595 600 605 

His Pro Ser Cys Val Thr Arg Gin Lys Val He Gly Ala Gly Glu Phe 
610 615 ' 620 

i 

Gly Glu Val Tyr Lys Gly Met Leu Lys Thr Ser Ser Gly Lys Lys Glu 
625 630 635 640 1 

Val Pro Val Ala He Lys Thr Leu Lys Ala Gly Tyr Thr Glu Lys Gin 
645 650 655 

Arg Val Asp Phe Leu Gly Glu Ala Gly He Met Gly Gin Phe Ser His 
660 665 670 

His Asn He He Arg Leu Glu Gly Val He Ser Lys Tyr Lys Pro Met 
675 680 685 

Met He He Thr Glu Tyr Met Glu Asn Gly Ala Leu Asp Lys Phe Leu 
690 695 700 

Arg Glu Lys Asp Gly Glu Phe Ser Val Leu Gin Leu Val Gly Met Leu 
705 710 715 720 

Arg Gly He Ala Ala Gly Met Lys Tyr Leu Ala Asn Met Asn Tyr Val 
725 730 735 

His Arg Asp Leu Ala Ala Arg Asn He Leu Val Asn Ser Asn Leu Val 
740 745 750 

Cys Lys Val Ser Asp Phe Gly Leu Ser Arg Val Leu Glu Asp Asp Pro 
755 760 765 
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Glu Ala Thr Tyr Thr Thr Ser Gly Gly Lys He Pro He Arg Trp Thr 
770 775 780 

Ala Pro Glu Ala He Ser Tyr Ar ( g Lys Phe Thr Ser Ala Ser Asp Val 
785 790 795 800 

Trp Ser Phe Gly He Val Met Trp Glu Val Met Thr Tyr Gly Glu Arg 
805 810 815 

Pro Tyr Trp Glu Leu Ser Asn His Glu Val Met Lys Ala He Asn Asp 
820 825 , 830 

Gly Phe Arg Leu Pro Thr Pro Met Asp Cys Pro Ser Ala He Tyr Gin 
835 840 845 

Leu Met Met Gin Cys Trp Gin Gin Glu Arg Ala Arg Arg Pro Lys Phe 
850 855 860 

Ala Asp He Val Ser He Leu Asp Lys Leu He Arg Ala' Pro Asp Ser 
865 870 875 880 

Leu Lys Thr Leu Ala Asp Phe Asp Pro Arg Val Ser He Arg Leu Pro 
885 , 890 895 , 

Ser Thr Ser Gly Ser Glu Gly Val Pro Phe Arg Thr Val Ser Glu Trp 
900 905 910 

Leu Glu Ser He Lys Met Gin Gin Tyr Thr Glu His Phe Met Ala Ala 
915 920 925 

Gly Tyr Thr Ala He Glu! Lys Val Val Glh Met Thr Asn Asp Asp He 
930 935 940 

Lys Arg He Gly Val Arg Leu Pro Gly His Gin Lys Arg He Ala Tyr 
945 950 955 960 

Ser Leu Leu Gly Leu Lys Asp Gin Val Asn Thr Val Gly He Pro He 
965 970 975 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 984 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 19: 

Met Glu Arg Arg Trp Pro Leu Gly Leu Gly Leu Val Leu Leu Leu Cys 
1 5 10 15 



WO 95/28484 PCT/US95/04681 

- 79 - « 



Ala Pro Leu Pro Pro Gly Ala Arg Ala Lys Glu Val Thr Leu Met Asp 
20 25 30 

Thr Ser Lys Ala Gin Gly Glu Leu Gly Trp Leu Leu Asp Pro Pro Lys 
35 40 45 

1 K 
i , ' 

Asp Gly Trp Ser Glu Gin Gin Gin lie Leu, Asn Gly Thr Pro Leu Tyr 
50 55 60 

Met Tyr Gin Asp Cys Pro Met Gin Gly Arg Arg Asp Thr Asp His Trp 
65 70 75 80 

i 

i f 

Leu Arg Ser Asn Trp lie Tyr Arg Gly Glu Glu Ala Ser Arg Val His 
85 90 95 

I . i 

Val Glu Leu Gin Phe Thr Val Arg Asp Cys Lys Ser Phe Pro Gly Gly 

100 1 105 110 

i 

Ala Gly Pro Leu Gly Cys Lys Glu Thr Phe Asn Leu (Leu Tyr Met Glu 

115 120 125 1 

Ser Asp Gin Asp Val Gly lie Gin Leu Arg Arg Pro Leu Phe Gin Lys 
130 ( 135 140 

Val Thr Thr Val Ala Ala Asp Gin Ser Phe Thr lie Arg Asp Leu Ala 
145 150 155 160 

Ser Gly Ser Val Lys Leu Asn Val Glu Arg Cys Ser Leu Gly Arg Leu 
165 170 175 

Thr Arg Arg Gly Leu Tyr Leu Ala Phe His Asn Pro Gly Ala Cys Val 

180 185 190 1 

Ala Leu Val Ser Val Arg Val Phe Tyr Gin Arg Cys Pro Glu Thr Leu 
195 200 205 

Asn Gly Leu Ala Gin Phe Pro Asp Thr Leu Pro Gly Pro Ala Gly Leu 
210 215 220 

Val Glu Val Ala Gly Thr Cys Leu Pro His Ala Arg Ala Ser Pro Arg 
225 230 235 240 

Pro Ser Gly Ala Pro Arg Met His Cys Ser Pro Asp Gly Glu Trp Leu 
245 250 255 

Val Pro Val Gly Arg Cys His Cys Glu Pro Gly Tyr Glu Glu Gly Gly 
260 265 270 

Ser Gly Glu Ala Cys Val Ala Cys Pro Ser Gly Ser Tyr Arg Met Asp 
275 280 285 

Met Asp Thr Pro His Cys Leu Thr Cys Pro Gin Gin Ser Thr Ala Glu 
290 295 300 

Ser Glu Gly Ala Thr lie Cys Thr Cys Glu Ser Gly His Tyr Arg Ala 
305 310 315 320 
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Pro Gly Glu Gly Pro Gin Val Ala Cys Thr Gly Pro Pro Ser Ala Pro 
325 330 335 

Arg Asn Leu Ser Phe Ser Ala Ser Gly Thr Gin Leu Ser Leu Arg Trp 
340 345 350 

* 

Glu Pro Pro Ala Asp Thr Gly Gly Arg Gin Asp Val Arg Tyr Ser Val 
355 360 365 

Arg Cys Ser Gin Cys Gin Gly Thr Ala Gin Asp Gly Gly Pro Cys Gin 

370 375 , 380 

i 

Pro Cys Gly Val Gly Val His Phe Ser Pro Gly Ala Arg Ala Leu Thr 
385 390 395 400 

Thr Pro Ala Val His Val Asn Gly Leu Glu Pro Tyr Ala Asn Tyr Thr 
405 410 415 

Phe Asn Val Glu Ala Gin Asn Gly Val Ser Gly Leu Gly ' Ser Ser Gly 
420 425 430 

His Ala Ser Thr Ser Val Ser lie Ser Met Gly His Ala Glu Ser Leu 
435 i 440 445 

Ser Gly Leu Ser Leu Arg Leu Val Lys Lys Glu Pro Arg Gin Leu Glu 
450 455 460 

Leu Thr Trp Ala Gly Ser Arg Pro Arg Ser Pro Gly Ala Asn Leu Thr 
465 470 475 480 

Tyr Glu Leu His Val Leui Asn Gin Asp Glu Glu Arg Tyr Gin Met Val 
485 490 1 495 

Leu Glu Pro Arg Val Leu Leu Thr Glu Leu Gin Pro Asp Thr Thr Tyr 
500 505 510 

lie Val Arg Val Arg Met Leu Thr Pro Leu Gly Pro Gly Pro Phe Ser 
515 520 525 

Pro Asp His Glu Phe Arg Thr Ser Pro Pro Val Ser Arg Gly Leu Thr 
530 535 540 

Gly Gly Glu lie Val Ala Val lie Phe Gly Leu Leu Leu Gly Ala Ala 
545 550 555 560 

Leu Leu Leu Gly He Leu Val Phe Arg Ser Arg Arg Ala Gin Arg Gin 
565 570 575 

Arg Gin Gin Arg His Val Thr Ala Pro Pro Met Trp He Glu Arg Thr 
580 585 590 

Ser Cys Ala Glu Ala Leu Cys Gly Thr Ser Arg His Thr Arg Thr Leu 
595 600 605 

His Arg Glu Pro Trp Thr Leu Pro Gly Gly Trp Ser Asn Phe Pro Ser 
610 615 620 



i 
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Arg Glu Leu Asp Pro Ala Trp Leu Met val Asp Thr Val He Gly Glu 
625 630 635 640 

i 

Gly Glu Phe Gly Glu Val Tyr Arg Gly Thr Leu Arg Leu Pro Ser Gin 
645 , 650 655 

Asp Cys Lys Thr Val Ala He Lys Thr Leu Lys Asp Thr Ser Pro Gly 
660 665 670 

Gly Gin Trp Trp Asn Phe Leu Arg Glu Ala Thr lie Met Gly Gin Phe 
675 . 680 ' 685 

Ser His Pro His lie Leu His Leu Glu Gly. Val Val Thr Lys Arg Lys 
690 695 700 

Pro He Met He He Thr Glu Phe Met Glu Asn Ala Ala Leu Asp Ala 
705 710 715 720 

Phe Leu Arg Glu Arg Glu Asp Gin Leu Val Pro Gly Gin Leu Val Ala 
725 730 735 

Met Leu Gin Gly He Ala Ser Gly Met Asn Tyr Leu Ser Asn His Asn 
740 1 745 750 1 

Tyr Val His Arg Asp Leu Ala Ala Arg Asn He Leu Val Asn Gin Asn 
755 760 765 

Leu Cys Cys Lys Val Ser Asp Phe Gly Leu Thr Arg Leu Leu Asp Asp 
770 775 780 

! • ■ 

Phe Asp Gly Thr Tyr Glu Thr Gin Gly Gly Lys He Pro He Arg Trp 
785 790 795 800 

Thr Ala Pro Glu Ala He Ala His Arg He Phe Thr Thr Ala Ser Asp 
805 810 815 

Val Trp Ser Phe Gly He Val Met Trp Glu Val Leu Ser Phe Gly Asp 
820 825 830 

Lys Pro Tyr Gly Glu Met Ser Asn Gin Glu Val Met Lys Ser He Glu 
835 840 845 

Asp Gly Tyr Arg Leu Pro Pro Pro Val Asp Cys Pro Ala Pro Leu Tyr 
850 855 860 

Glu Leu Met Lys Asn Cys Trp Ala Tyr Asp Arg Ala Arg Arg Pro His 
865 870 875 880 

Phe Gin Lys Leu Gin Ala His Leu Glu Gin Leu Leu Ala Asn Pro His 
885 890 895 

Ser Leu Arg Thr He Ala Asn Phe Asp Pro Arg Val Thr Leu Arg Leu 
900 905 910 

Pro Ser Leu Ser Gly Ser Asp Gly He Pro Tyr Arg Thr Val Ser Glu 
915 920 925 
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Trp Leu Glu Ser He Arg Met Lys Arg Tyr He Leu His Phe His Ser 
930 935 940 

Ala Gly Leu Asp Thr Met Glu Cys Val Leu Glu Leu Thr Ala Glu Asp 
945 . 950 955 960 

Leu Thr Gin Met Gly He Thr Leu Pro Gly His Gin Lys Arg He Leu 



965 970 

Cys Ser He Gin Gly Phe Lys Asp 
980 } 



975 



(2) INFORMATION FOR SEQ 1 ID NO: 20: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 998 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Ala Arg Ala Arg Pro Pro Pro Pro Pro Ser Pro Pro Pro Gly Leu 
1 5 10 15 

Leu Pro Leu Leu Pro Pro Leu Leu Leu Leu Pro Leu Leu Leu Leu Pro 
20 25 1 30 

Ala Gly Cys Arg Ala Leu Glu Glu Thr Leu Met Asp Thr Lys Trp Val 1 
35 40 45 

Thr Ser Glu Leu Ala Trp Thr Ser His Pro Glu Ser Gly Trp Glu Glu 
50 55 60 

Val Ser Gly Tyr Asp Glu Ala Met Asn Pro He Arg Thr Tyr Gin Val 
65 7 ° 75 80 

Cys Asn Val Arg Glu Ser Ser Gin Asn Asn Trp Leu Arg Thr Glv Phe 
65 90 95 

He Trp Arg Arg Asp Val Gin Arg Val Tyr Val Glu Leu Lys Phe Thr 
100 105 no 

Val Arg Asp Cys Asn Ser He Pro Asn He Pro Gly Ser Cys Lys Glu 
115 120 125 

Thr Phe Asn Leu Phe Tyr Tyr Glu Ala Asp Ser Asp Val Ala Ser Ala 
130 135 140 

Ser Ser Pro Phe Trp Met Glu Asn Pro Tyr Val Lys Val Asp Thr He 
145 150 155 i 6 o 

Ala Pro Asp Glu Ser Phe Ser Arg Leu Asp Ala Gly Arg Val Asn Thr 
165 ivo 175 
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Lys Val Arg Ser Phe Gly Pro Leu Ser Lys Ala Gly Phe Tyr Leu Ala 
180 185 190 

Phe Gin Asp Gin Gly Ala Cys Met Ser Leu lie Ser Val Arg 'Ala Phe 

195 200 1 205 * 

1 i ' 

Tyr Lys Lys Cys Ala Ser Thr Thr Ala Gly Phe Ala Leu Phe Pro Glu 
. 210 215 220 

Thr Leu Thr Gly Aia Glu Pro Thr Ser Leu Val lie Ala Pro Gly Thr 
225 I 230 f 235 240 

Cys lie Pro Asn Ala Val Glu Val Ser Val Pro Leu Lys Leu Tyr Cys 
' 245 250 255 

Asn Gly Asp Gly Glu Trp Met Val Pro Val Gly Ala Cys Thr Cys Ala 
260 ' 265 270 

Thr Gly His Glu Pro Ala Ala Lys Glu Ser Gin Cys^Arg Pro Cys Pro i 
275 280 285 

Pro Gly Ser Tyr Lys Ala Lys <Gln Gly Glu Gly Pro Cys Leu Pro Cys 
290 ! 295 . 300 

Pro Pro Asn Ser Arg Thr Thr Ser Pro Ala Ala Ser lie Cys Thr Cys 
305 310 315 320 

His Asn Asn Phe Tyr Arg Ala Asp Ser Asp Ser Ala Asp Ser Ala Cys 
325 336 335 

Thr Thr Val Pro Ser Pro Pro Arg Gly Val lie Ser Asn Val Asn Glu f 
340 345 350 

Thr Ser Leu lie Leu Glu Trp Ser Glu Pro Arg Asp Leu Gly Val Arg 
355 360 365 

Asp Asp Leu Leu Tyr Asn Val lie Cys Lys Lys Cys His Gly Ala Gly 
370 375 380 

Gly Ala Ser Ala Cys Ser Arg Cys Asp Asp Asn Val Glu Phe Val Pro 
385 390 395 400 

Arg Gin Leu Gly Leu Ser Glu Pro Arg Val His Thr Ser His Leu Leu 
405 410 415 

Ala His Thr Arg Tyr Thr Phe Glu Val Gin Ala Val Asn Gly Val Ser 
420 425 430 

Gly Lys Ser Pro Leu Pro Pro Arg Tyr Ala Ala Val Asn lie Thr Thr 
435 440 445 

Asn Gin Ala Ala Pro Ser Glu Val Pro Thr Leu Arg Leu His Ser Ser 
450 455 460 

Ser Gly Ser Ser Leu Thr Leu Ser Trp Ala Pro Pro Glu Arg Pro Asn 
465 470 475 480 
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Gly Val lie Leu Asp Tyr Glu Met Lys Tyr Phe Glu Lys Ser Glu Gly 
485 490 495 

He Ala Ser Thr Val Thr Ser Gin Met Asn Ser Val Gin Leu Asp Gly 
500 505 510 

Leu Arg Pro Asp Ala Arg Tyr Val Val Gin Val Arg Ala Arg Thr Val 
515 ' 520 . 525 

Ala Gly Tyr Gly Gin Tyr Ser Arg Pro Ala Glu Phe Glu Thr Thr Ser 
530 535 , 540 

Glu Arg Gly Ser Gly Ala Gin Gin Leu Gin Glu Gin Leu Pro Leu He 
545 550 555 560 

Val Gly Ser Ala Thr Ala Gly Leu Val Phe Val Val Ala Val Val Val 
565 570 575 

He Ala He Val Cys Leu Arg Lys Gin Arg His Gly Ser Asp Ser Glu 
580 585 590 

Tyr Thr Glu Lys Leu Gin Gin Tyr He Ala Pro Gly Met Lys Val Tvr 
595 , 600 605 , 

He Asp Pro Phe Thr Tyr Glu Asp Pro Asn Glu Ala Val Arg Glu Phe 
610 615 620 

Ala Lys Glu He Asp Val Ser Cys Val Lys He Glu Glu Val He Gly 
625 «30 635 640 

Ala Gly Glu Phe Gly Glii Val Cys Arg Gly Arg Leu Lys Gin Pro Gly 
"5 650 ' 655 

Arg Arg Glu Val Phe Val Ala He Lys Thr Leu Lys Val Gly Tyr Thr 
660 665 670 

Glu Arg Gin Arg Arg Asp Phe Leu Ser Glu Ala Ser He Met Gly Gin 
675 680 685 

Phe Asp His Pro Asn He He Arg Leu Glu Gly Val Val Thr Lvs Ser 
M0 695 700 

Arg Pro Val Met He Leu Thr Glu Phe Met Glu Asn Cys Ala Leu Asp 
705 710 715 720 

Ser Phe Leu Arg Leu Asn Asp Gly Gin Phe Thr Val He Gin Leu Val 
725 730 735 

Gly Met Leu Arg Gly He Ala Ala Gly Met Lys Tyr Leu Ser Glu Met 
7 40 745 750 

Asn Tyr Val His Arg Asp Leu Ala Ala Arg Asn He Leu Val Asn Ser 
755 760 765 

Asn Leu Val Cys Lys Val Ser Asp Phe Gly Leu Ser Arg Phe Leu Glu 
770 775 780 



f 
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Asp Asp Pro Ser Asp Pro Thr Tyr Thr Ser Ser lieu Gly Gly Lys lie 
785 790 795 800 

Pro lie Arg Trp Thr Ala Pro Giu Ala lie Ala Tyr Arg Lys Phe Thr 
805 810 815 

Ser Ala Ser Asp Val Trp Ser Tyr Gly lie Val Met Trp Glu Val Met 
820 825 830 

Ser Tyr Gly Glu Arg Pro Tyr Trp Asp Met Ser Asn Gin Asp Val lie 
835 ■ 840 ' 845 

Asn Ala Val Glu Gin Asp Tyr Arg Leu Pro Pro Pro Met Asp Cys Pro 
850 855 860 

Thr Ala Leu His Gin Leu Met Leu Asp Cys Trp Val Arg Asp Arg Asn 

865 870 875 880 

t 

Leu Arg Pro Lys Phe Ser Gin lie Val Asn Thr Leu Asp Lys Leu lie 
885 890 895 

Arg Asn Ala Ala Ser Leu Lys Val lie Ala Ser Ala Gin Ser Gly ^iet 
900 905 910 

Ser Gin Pro Leu Leu Asp Arg Thr Val Pro Asp Tyr Thr Thr Phe Thr 
915 920 925 

Thr Val Gly Asp Trp Leu Asp Ala lie Lys Met Gly Arg Tyr Lys Glu 
930 935 940 

I 

Ser Phe Val Ser Ala Gly Phe Ala Ser Phe Asp Leu Val Ala Gin Met 
945 950 955 960 

Thr Ala Glu Asp Leu Leu Arg lie Gly Val Thr Leu Ala Gly His Gin 
965 970 975 

Lys Lys lie Leu Ser Ser lie Gin Asp Met Arg Leu Gin Met Asn Gin 
980 985 990 

Thr Leu Pro Val Gin Val 
995 

(2) INFORMATION FOR SEQ ID NO:21: 

(i) * SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 983 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Asp Cys Gin Leu Ser lie Leu Leu Leu Leu Ser Cys Ser Val Leu 
15 10 15 
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Asp Ser Phe Gly Glu Leu lie Pro Gin Pro Ser Asn Glu Val Asn Leu 
20 25 30 

Leu Asp Ser Lys Thr, lie Gin Gly Glu Leu Gly Trp lie Ser Tyr Pro 
35 40 ( 45 

\ 

1 i 

Ser His Gly Trp Glu Glu lie Ser Gly Val , Asp Glu His Tyr Thr Pro 
50 55 60 

lie Arg Thr Tyr Gin Val Cys Asn Val Met Asp His Ser Gin Asn Asn 
65 70 75 80 

i t 

Trp Leu Arg Thr Ash Trp Val Pro Arg Asn Ser Ala Gin Lys lie Tyr 
,85 .90 95 

Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn Ser lie Pro Leu Val 

100 ' 105 110 

i 

Leu Gly, Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Met Glu Ser Asp 
115 120 125 

Asp Asp His Gly Val Lys Phe Arg Glu His Gin Phe Thr Lys lie Asp 
130 ( 135 140 

Thr lie Ala Ala Asp Glu Ser Phe Thr Gin Met Asp Leu Gly Asp Arg 
145 150 155 160 

lie Leu Lys Leu Asn Thr Glu lie Arg Glu Val Gly Pro Val Asn Lys 
165 1^70 175 

Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cys Val Ala Leu 
180 185 190 

Val Ser Val Arg Val Tyr Phe Lys Lys Cys Pro Phe Thr Val Lys Asn 
195 200 205 

Leu Ala Met Phe Pro Asp Thr Val Pro Met Asp Ser Gin Ser Leu Val 
210 215 220 

Glu Val Arg Gly Ser Cys Val Asn Asn Ser Lys Glu Glu Asp Pro Pro 
225 230 235 240 

Arg Met Tyr Cys Ser Thr Glu Gly Glu Trp Leu Val Pro lie Gly Lys 
245 250 255 

Cys Ser Cys Asn Ala Gly Tyr Glu Glu Arg Gly Phe Met Cys Gin Ala 
260 265 270 

Cys Arg Pro Gly Phe Tyr Lys Ala Leu Asp Gly Asn Met Lys Cys Ala 
275 280 285 

Lys Cys Pro Pro His Ser Ser Thr Gin Glu Asp Gly Ser Met Asn Cys 
290 295 300 

Arg Cys Glu Asn Asn Tyr Phe Arg Ala Asp Lys Asp Pro Pro Ser Met 
305 310 315 320 
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Ala Cys Thr Arg Pro Pro Ser Ser Pro Arg Asn Val He Ser Asn He 
325 330 335 

Asn Glu Thr Ser Val He Leu Asp Trp Ser Trp Pro Leu Asp Thr Gly 
340 ,345 350 

Gly Arg Lys Asp Val Thr Phe Asn He He Cys Lys Lys Cys Gly Trp 
355 360 365 

Asn He Lys Gin Cys Glu Pro Cys Ser Pro Asn Val Arg Phe Leu Pro 
370 375 ' 380 

Arg Gin Phe Gly Leu Thr Asn Thr Thr Val Thr Val Thr Asp Leu Leu 
385 390 395 400 

Ala His Thr Asn Tyr Thr Phe Glu He Asp Ala Val Asn Gly Val Ser 
405 410 415 

! 

Glu Leu Ser Ser Pro Pro Arg Gin Phe Ala Ala Val Ser He Thr Thr 
420 425 430 

Asn Gin Ala Ala Pro Ser , Pro Val Leu Thr He Lys Lys Asp Arg TJir 
435 440 445 

Ser Arg Asn Ser He Ser Leu Ser Trp Gin Glu Pro Glu His Pro Asn 
450 455 460 

Gly He He Leu Asp Tyr Glu Val Lys Tyr Tyr Glu Lys Gin Glu Gin 
465 470 475 480 

I 

Glu Thr Ser Tyr Thr lie Leu Arg Ala Arg Gly Thr Asn Val Thr He 
485 490 495 

Ser Ser Leu Lys Pro Asp Thr He Tyr Val Leu Gin He Arg Ala Arg 
500 505 510 

Thr Ala Ala Gly Tyr Gly Thr Asn Ser Arg Lys Phe Glu Phe Glu Thr 
515 520 525 

Ser Pro Asp Ser Phe Ser He Ser Gly Glu Ser Ser Gin Val Val Met 
530 535 540 

He Ala He Ser Ala Ala Val Ala He lie Leu Leu Thr Val Val He 
545 550 555 560 

Tyr Val Leu lie Gly Arg Phe Cys Gly Tyr Lys Ser Lys His Gly Ala 
565 570 575 

Asp Glu Lys Arg Leu His Phe Gly Asn Gly His Leu Lys Leu Pro Gly 
580 585 590 

Leu Arg Thr Tyr Val Asp Pro His Thr Tyr Glu Asp Pro Thr Gin Ala 
595 600 605 

Val His Glu Phe Ala Lys Glu Leu Asp Ala Thr Asn He Ser lie Asp 
610 615 620 
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Lys Val Val Gly Ala Gly Glu Phe Gly Glu Val Cys Ser Gly Arg Leu 
625 630 635 640 

Lys Leu Pro Ser Lys Lys Glu He Ser Val Ala lie Lys Thr Leu Lys 
645 650 655 

Val Gly ,Tyr |Thr Glu Lys Gin Arg Arg Asp Phe, Leu Gly Glu Ala *Ser 
660 665 i 670 

He Met Gly Gin Phe Asp His Pro Asn lie He Arg Leu Glu Gly Val 
675 , 680 1 685 

Val Thr Lys Ser Lys Pro Val, Met He Val Thr Glu Tyr Met Glu Asn 
690 695 700 

Gly Ser Leu Asp Ser Phe Leu Arg Lys His Asp Ala Gin Phe Thr Val 

705 710 715 720 

i 

He Gin Leu Val Gly Met Leu Arg Gly He Ala Ser Gly Met Lys Tyr 
725 730 . 735 

K i 

Leu Ser Asp Met Gly Tyr Val His Arg Asp Leu Ala Ala Arg Asn He 
740 745 ' 750 

Leu He dsn Ser Asn Leu Val Cys Lys Val Ser Asp Phe Gly Leu Ser 
755 760 765 

Arg Val Leu Glu Asp Asp Pro Glu Ala Ala Tyr Thr Thr Arg Gly Gly 
770 775 780 

Lys He Pro He Arg Trp Thr Ser Pro Glu Ala He Ala Tyr Arg Lys 
785 790 795 # 800 ( 

Phe Thr Ser Ala Ser Asp Val Trp Ser Tyr Gly He Val Leu Trp Glu 
805 810 815 

Val Met Ser Tyr Gly Glu Arg Pro Tyr Trp Glu Met Ser Asn Gin Asp 
820 825 830 

Val He Lys Ala Val Asp Glu Gly Tyr Arg Leu Pro Pro Pro Met Asp 
835 840 845 

Cys Pro Ala Ala Leu Tyr Gin Leu Met Leu Asp Cys Trp Gin Lys Asp 
850 855 860 

Arg Asn Asn Arg Pro Lys Phe Glu Gin He Val Ser He Leu Asp Lys 
865 870 875 880 

Leu He Arg Asn Pro Gly Ser Leu Lys He He Thr Ser Ala Ala Ala 
885 890 895 

Arg Pro Ser Asn Leu Leu Leu Asp Gin Ser Asn Val Asp He Ser Thr 
900 905 910 

Phe Arg Thr Thr Gly Asp Trp Leu Asn Gly Val Arg Thr Ala His Cys 
915 920 925 
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Lys Glu He Phe Thr Gly Val Glu Tyr Ser Ser Cys Asp Thr He Ala 
930 935 940 

Lys lie Ser Thr Aso Asp Met Lys Lys Val Gly Val Thr Val Val Gly 
945 950 955 960 

Pro Gin Lys iys He He Ser Ser He Lys Ala Leu Glu Thr Gin Ser 
965 970 975 

Lys Asn Gly Pro Val Pro Val 
980 1 

I 

(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single' 

(D) TOPOLOGY: linear k 
(ii) MOLECULE TYPE: CDNA 

i 

t 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 



CTQCTCGCCG CCGTGGAAGA AACG 24 
(2) INFORMATION FOR SEQ ID NO: 23: 



i 



(i); - SEQUENCE CHARACTERISTICS: 
5^(MA) LENGTH: 39 base pairs 
V V(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GCGTCTAGAT TATCACTTCT CCTGGATGCT TGTCTGGTA 39 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
GCGGACGCCG CCGCCATGGC CCTGGATTGC CTGCTGCTGT TCCTCCTG 
(2) INFORMATION FOR SEQ ID NO: 25: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear , 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CGTTTCTTCC ACGGCGGCGA GCAGAGATGC CAGGAGGAAC AGCAGCAGGC AATC 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

! 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2 6: 

Met Ala Leu Asp Cys Leu Leu Leu Phe Leu Leu Ala Ser 
15 io 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
AGGGAATTCC AYCGNGAYYT NGCNGC 



26 
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(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
<B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

24 



/ 



i 
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WHAT IS CLAIMED IS: 



10 



15 



20 



1. An isolated nucleic acid encoding a 
polypeptide having at least one of the biological 
activities of an EPH-like receptor protein tyrosine 
kinase, the nucleic acid selected from the group 
consisting of: 

(a) , the nucleic acids set forth in any of SEQ 
ID NO: 10, SEQ ID NO: 'l2, SEQ ID NO: 14 or SEQ ID NO: 16 
and their complementary strands; 

(b) ; a nucleic acid hybridizing to the coding 
regions of the nucleic icids in any of SEQ ID NO: 10, 
SEQ ID NO: 12, SEQ ID NO: 14 or SEQ ID NO: 16; and , 

(c) a nucleic acid of (b) which, but for the 
degeneracy of the genetic code, would hybridize to the 
coding regions of the nucleic acids in any of SEQ ID NO: 
10, SEQ ID NO: 12, SEQ ID NO: 14 or SEQ ID NO: 16. 

2. A s polypeptide product Q f expression of a 
nucleic acid of Claim 1 in a procaryotic or eucaryotic 
host cell. ! 



25 



30 



3. 

human origin, 



A nucleic acid of Claim 1 which is of 



4. A nucleic acid of Claim 1 which encodes 
polypeptide having part or all of the amino acid 
sequence as shown in any of SEQ ID NO: 10, SEQ ID NO: 
12, SEQ ID NO: 14 or SEQ ID NO: 16. 

5. A nucleic acid of Claim 1 encoding a 
fragment comprising an EPH-like receptor extracellular 
domain . 



35 6. A nucleic acid of Claim 1 which is cDNA, 

genomic DNA, synthetic DNA or RNA. 
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7. A nucleic acid of Claim 1 which includes 
one or more codons preferred for expression in E. cdli 
host cells, ' * 

5 

8. A nucleic acid of Claim 1 which includes 
one or more cbdon preferred for expression in mammalian 
cells. 1 i 1 

i . • 

10 9. A nucleic acid encoding amino acids 6-524 

as set forth in SEQ ID 1^0: 10, and optionally encoding 

an amino terminal methionyl residue. 

10. A nucleic acid encoding amino acids 1-547 
15 as set 1 forth in SEQ ID NO: 12, and . optionally encoding 

an amino acid terminal methionyl residue. 

11. A nucleic acid t encoding amino acids 21- 
547 as set forth in SEQ ID NO: 'l4, and optionally 

20 encoding an amino terminal methionyl residue. 1 

12. A nucleic acid encoding amino acids 23- 
553 as set forth in SEQ ID NO: 16, and optionally 
encoding an amino terminal methionyl residue. 



25 
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13. A nucleic acid encoding a chimeric 
protein, wherein the protein comprises an EPH-like 
receptor extracellular domain fused to a heterologous 
receptor cytoplasmic domain. 

14. A nucleic acid of Claim 13 wherein the 
extracellular domain is selected from the group 
consisting of HEK5, HEK7, HEK8 and HEK11 extracellular 
domains . 



35 
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15. A biologically functional plasmid or 
viral DNA vector including a nucleic acid of Claim 1. 

i 

16. A procaryotic or eucaryotic host cell 
stably transformed or transfected with the plasmid of 
Claim 15. 



i 



17. A method of producing an EPH-like 
receptor protein tyrosine kinase comprising culturing 
the host cell of Claim 16 to allow the host cell to 
express the EPH-like receptor protein tyrosine kinase. 

18. An isolated polypeptide having an amino 
acid sequence as shown in any of SEQ ID NO: 10, SEQ ID 
NO: 12, SEQ ID NO: ' 14 or SEQ ID NO: 16, or a fragment or 
analog thereof, wherein the polypeptide has at least one 
of the biological activities of an EPH-like receptor 
protein tyrosine kinase. 



19. 


Purified 


and 


isolated HEK5 receptor. 


20. 


Purified 


and 


isolated HEK7 receptor. 


21. 


Purified 


and 


isolated HEK8 receptor. 


22. 


Purified 


and 


isolated HEK11 receptor. 


23. 


A polypeptide 


of Claim 18 wherein the 



biological activity is the binding of a ligand 
30 

24. A polypeptide of Claim 18 which is of 
human origin. 

25. A polypeptide of Claims 18 characterized 
35 by being the product of procaryotic or eucaryotic 

expression of an exogenous DNA sequence. 
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26. A polypeptide of Claim 25 wherein the 
exogenous DNA is a cDNA. 

1 i 
5 27. A polypeptide of' Claim 25 wherein the 

exogenous DNA is a genomic DNA. 

28. 1 An antibody ior fragment thereof 
specifically binding a polypeptide of Claim 18. 

10 

29. An antibody of Claim 28 which is a 
monoclonal antibody. v 

30. A pharmaceutical composition comprising a 
15 therapeutically effective amount of a polypeptide of 

Claim 18 in a mixture with a pharmaceutically acceptable 
adjuvant , carrier, solubilizer or diluent. 

t 

31. A pharmaceutical composition comprising a 
20 therapeutically effective amount of an antibody of Claim 

28 in a mixture with a pharmaceutically acceptable 
adjuvant, carrier, solubilizer or diluent. 

32. A method for modulating the endogenous 
25 activation of an EPH-like receptor protein tyrosine 

kinase comprising administering an effective amount of a 
polypeptide of Claim 18. 

33. A method for modulating the synthesis of 
30 an EPH-like receptor protein tyrosine kinase comprising 

hybridizing an antisense oligonucleotide to a nucleic 
acid of Claim 1. 



BNSDOCID: <WO 9528484 A 1J_> 



J 



WO 95/28484 PCT/DS95/04681 

- 96 - 

34 • A method of identifying a ligand that 
binds to a receptor polypeptide of Claim 18 comprising 
the steps of: * ' 

a) exposing at . least one molecule to the 
5 receptor polypeptide for a time sufficient to allow 

formation of a receptor/ ligand complex; 

i 

b) removing non-complexed molecules; and 

c) detecting the presence of the molecule 
bound to the receptor polypeptide. 

10 



i 



WO 95/28484 PCT/DS95/54681 



1/33 

FIG. IA 

CTG CTC GCC GCC GTG GAA GAA ACG CTA ATG GAC TCC ACT ACA GCG ACT 
Leu Leu Ala Ala Val Glu Glu Thr Leu Met Asp .Ser Thr Thr Ala Thr 
1 5 ' 10 15 
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GCT GAG CTG GGC TGG ATG GTG CAT CCT CCA TCA GGG TGG GAA GAG GTG 
Ala Glu Leu Gly Tip Met Val His Pro Pro Ser Gly Tip Glu Glu Val 
20 25 30 

t 

AGT GGC TAC GAT GAG AAC ATG AAC ACG ATC CGC ACG TAC CAG GTG TGC 
Ser Gly Tyr Asp Glu Asn Met Asn Thr lie Arg Thr Tyr Gin Val Cys 
35 40 45 



96 



144 



AAC GTG TTT GAG TCA AGC CAG AAC AAC TGG CTA CGG ACC AAG TTT ATC 
Asn Val Phe Glu Ser Ser Gin Asn Asn Trp Leu Arg Thr Lys Phe lie 
50 55 60 



192 



CGG CGC CGT GGG GCC CAC CGC ATC CAC GTG GAG ATG AAG TTT TCG GTG 
Arg Arg Arg Gly Ala His Arg, lie His Val Glu Met Lys Phe Ser Val 



65 



70 



75 



80 



240 



CGT GAC TGC AGC AGC ATC CCC AGC GTG CCT GGC TCC TGC AAG GAG ACC 
Arg Asp Cys Ser Ser lie Pro Ser Val Pro Gly Ser Cys Lys Glu Thr 

85 90 95 



288 



TTC AAC CTC TAT TAC TAT GAGi GCT GAC TTT GAC TCG GCC ACC AAG ACC 
Phe Asn Leu Tyr Tyr Tyr Glu Ala Asp Phe Asp Ser Ala Thr Lys Thr 
100 105 110 



336 



TTC CCC AAC TGG ATG GAG AAT CCA TGG GTG AAG GTG GAT ACC ATT GCA 
Phe Pro Asn Trp Met Glu Asn Pro Trp Val Lys Val Asp Thr lie Ala 
115 120 125 



384 



GCC GAC GAG AGC TTC TCC CAG GTG GAC CTG GGT GGC CGC GTC ATG AAA 
Ala Asp Glu Ser Phe Ser Gin Val Asp Leu Gly Gly Arg Val Met Lys 
130 135 140 



432 



ATC AAC ACC GAG GTG CGG AGC TTC GGA CCT GTG TCC CGC AGC GGC TTC 
lie Asn Thr Glu Val Arg Ser Phe Gly Pro Val Ser Arg Ser Gly Phe 
145 150 155 160 



480 



TAC CTG GCC TTC CAG GAC TAT GGC GGC TGC ATG TCC CTC ATC GCC GTG 
Tyr Leu Ala Phe Gin Asp Tyr Gly Gly Cys Met Ser Leu lie Ala Val 
165 170 " 175 



528 



CGT GTC TTC TAC CGC AAG TGC CCC CGC ATC ATC CAG AAT GGC GCC ATC 
Arg Val Phe Tyr Arg Lys Cys Pro Arg lie lie Gin Asn Gly Ala lie 
180 185 190 



576 
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FIG. IB 

TTC CAG GAA ACC CTG TCG GGG GCT GAG AGCACA TCG CTG GTG GCT GCC 
Phe Gin Glu Thr Leu Ser Gly Ala Glu Ser Thr Ser Leu Val Ala Ala 
195 200 205 



624 



CGG GGC AGC TGC ATC GCC AAT GCG GAA GAG GTG GAT GTA CCC ATC AAG 
Arg Gly Ser Cys He Ala Asn Ala Glu Glu Val Asp Val Pro He Lvs 
210 215 ' 220 

CTC TAC TGT AAC GGG GAC GGC GAG TGG CTG GTG CCC ATC GGG CGC TGC 
Leu Tyr Cys. Asn Gly Asp Gly Glu Trp Leu Val Pro lie Gly Arg Cys 
225 2 30 ' ' 235 240 

ATG TGC AAA GCA' GGC TTC GAG GCC GTT GAG AAT GGC ACC GTC TGC CGA 
Met Cys Lys Ala Gly Phe Glu Ala Val Glu Asn Gly Thr Val Cys Arg 
245 , 250 255 

GGT TGT CCA TCT GGG ACT TTC AAG GCC AAC CAA GGG GAT GAG GCC TGT 
Gly Cys Pro Ser Gly Thr Phe Lys Ala Asn Gin Gly Asp Glu Ala Cys 
260 ,265 270 

i 

ACC CAC TGT CCC ATC AAC AGC CGG ACC ACT TCT GAA GGG GCC ACC AAC 
Thr His Cys Pro He Asn Ser Arg Thr Thr Ser Glu Gly Ala Thr Asn 
275 280 285 

TGT GTC TGC CGC AAT GGC TAC TAC AGA GCA GAC CTG GAC CCC CTG GAC 
Cys Val Cys Arg Asn Gly Tyr Tyr Arg Ala Xsp Leu Asp Pro Leu Asp 
290 295 300 

ATG CCC TGC ACA ACC ATC CCC TCC GCG CCC CAG GCT GTG ATT TCC AGT 
Met Pro Cys Thr Thr He Pro Ser Ala Pro Gin Ala Val He Ser Ser 
305 310 315 320 

GTC AAT GAG ACC TCC CTC ATG CTG GAG TGG ACC CCT CCC CGC GAC TCC 
Val Asn Glu Thr Ser Leu Met Leu Glu Trp Thr Pro Pro Arg Asp Ser 
325 330 335 

GGA GGC CGA GAG GAC CTC GTC TAC AAC ATC ATC TGC AAG AGC TGT GGC 
Gly Gly Arg Glu Asp Leu Val Tyr Asn He He Cys Lys Ser Cys Gly 
340 345 350 

TCG GGC CGG GGT GCC TGC ACC CGC TGC GGG GAC AAT GTA CAG TAC GCA 
Ser Gly Arg Gly Ala Cys Thr Arg Cys Gly Asp Asn Val Gin Tyr Ala 
355 360 365 

CCA CGC CAG CTA GGC CTG ACC GAG CCA CGC ATT TAC ATC AGT GAC CTG 
Pro Arg Gin Leu Gly Leu Thr Glu Pro Arg He Tyr He Ser Asp Leu 
370 375 380 



672 



720 



768" 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



CTG GCC CAC ACC CAG TAC ACC TTC GAG ATC CAG GCT GTG AAC GGC GTT 1200 
Leu Ala His Thr Gin Tyr Thr Phe Glu He Gin Ala Val Asn Gly Val 
385 390 395 400 
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ACT GAC CAG AGC CCC TTC TCG CCT CAG TTC GCC TCT GTG AAC ATC ACC 1248 
Thr Asp Gin Ser Pro Phe Ser Pro Gin Phe Ala Ser Val Asn He Thr 
405 410 415 

ACC AAC CAG GCA GOT CCA TCG GCA GTG TCC ATC ATG CAT CAG GTG AGC 1296 
Thr Asn Gin Ala Ala Pro Ser Ala Val Ser He Met His Gin Val Ser 
4'20 i 425 430 

CGC ACC GTG GAC AGC ATT ACC CTG TCG TGG TCC CAG CCG GAC CAG CCC 1344 
Arg Thr Val Asp Ser He, Thr Leu Ser Trp Ser Gin Pro Asp Gin Pro 
435 440' 1 445 

AAT GGC GTG ATC 1 CTG GAC TAT GAG CTG CAG TAC TAT GAG AAG GAG CTC ' 1392 
Asn Gly Val He Leu Asp Tyr Glu Leu Gin Tyr Tyr Glu Lys Glu Leu 
450 455 , 460 

AGT GAG TAC AAC GCC ACA GCC ATA AAA AGC CCC ACC AAC: ACG GTC ACG 
Ser Glu Tyr Asn Ala Thr Ala lie Lys Ser Pro Thr Asn Thr Val Thr 
465 470 475 48,0 

GGC CTC AAA GCC GGC GCC ATC TAT GTC TTC CAG GTG CGG GCA CGC ACT 1488 
Gly Leu Lys Ala Gly Ala He Tyr Val Phe Gin Val Arg Ala Arg Thr 
485 490 495 

GTG GCA GGC TAC GGG CGC TAC AGC GGC AAG ATG TAC TTC CAG ACC ATG 1536 
Val Ala Gly Tyr Gly Arg Tyr Ser Gly Lys Met Tyr Phe Gin Thr Met 

500 505 510 i 

ACA GAA GCC GAG TAC CAG ACA AGC ATC CAG GAG AAG TTG CCA CTC ATC 1584 
Thr Glu Ala Glu Tyr Gin Thr Ser He Gin Glu Lys Leu Pro Leu He 
515 520 525 

ATC GGC TCC TCG GCC GCT GGC CTG GTC TTC CTC ATT GCT GTG GTT GTC 1632 
He Gly Ser Ser Ala Ala Gly Leu Vai Phe Leu He Ala Val Val Val 
530 535 540 

ATC GCC ATC GTG TGT AAC AGA CGG GGG TTT GAG CGT GCT GAC TCG GAG 1680 
He Ala He Val Cys Asn Arg Arg Gly Phe Glu Arg Ala Asp Ser Glu 
545 550 555 560 

TAC ACG GAC AAG CTG CAA CAC TAC ACC AGT GGC CAC ATA ACC CCA GGC 1728 
Tyr Thr Asp Lys Leu Gin His Tyr Thr Ser Gly His He Thr Pro Gly 
565 570 575 

ATG AAG ATC TAC ATC GAT CCT TTC ACC TAC GAG GAC CCC AAC GAG GCA 1776 
Met Lys He Tyr He Asp Pro Phe Thr Tyr Glu Asp Pro Asn Glu Ala 
580 585 590 

GTG CGG GAG TTT GCC AAG GAA ATT GAC ATC TCC TGT GTC AAA ATT GAG 1824 
Val Arg Glu Phe Ala Lys Glu He Asp He Ser Cys Val Lys He Glu 
595 600 605 
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FIG. ID 

CAG GTG ATC GGA GCA GGG GAG TTT GGC GAG GTC TGC AGT GGC CAC GTG 1872 
Gin Val He Gly Ala Gly Glu Phe Gly Glu Val Cys Ser Gly His Leu 
610 615 620 

AAG CTG CCA GGC AAG AGA GAG ATC TTT GTG GCC ATC AAG ACG CTC AAG 1920 
Lys Leu Pro Gly Lys Arg Glu He Phe Val Ala He Lys Thr Leu Lys 
625 , 630 635 640 

TCG GGC TAC ACG GAG AAG CAG CGC CGG GAC TTC CTG AGC GAA GCC TCC 1968 
Ser Gly Tyr Thr Glu Lys Gin Arg Arg Asp Phe Leu Ser Glu Ala Ser 
645 650 655 



ATC ATG GGC CAG TTC GAC CAT CCC AAC GTC ATC CAC CTG GAG GGT GTC 2016 
He Met Gly Gin Phe Asp His Pro Asn Val lie His Leu Glu Gly Val 
660 665 670 

GTG ACC AAG AGC ACA CCT GTG ATG ATC ATC ACC GAG TTC ATG GAG AAT 2064 
Val Thr Lys Ser Thr Pro Val Met He He Thr Glu Phe Met Glu Asn 
675 ,680 685 

GGC TCC CTG GAC TCC TTT CTC CGG CAA AAC GAT GGG CAG TTC ACA GTC 2112 
Gly Ser Leu Asp Ser Phe Leu Arg Gin Asn Asp Gly Gin Phe Thr Val 
690 695 700 

ATC CAG CTG GTG GGC ATG CTT CGG GGC ATC GCA GCT GGC ATG AAG TAC 2160 
He Gin Leu Val Gly Met Leu lArg Gly He Ala Ala Gly Met Lys Tyr 
705 710 715 ' 720 

CTG GCA GAC ATG AAC TAT GTT CAC CGT GAC CTG GCT GCC CGC AAC ATC 2208 
Leu Ala Asp Met Asn Tyr Val His Arg Asp Leu Ala Ala Arg Asn He 
725 730 735 

CTC GTC AAC AGC AAC CTG GTC TGC AAG GTG TCG GAC TTT GGG CTC TCA 2256 
Leu Val Asn Ser Asn Leu Val Cys Lys Val Ser Asp Phe Gly Leu Ser 
740 745 750 

CGC TTT CTA GAG GAC GAT ACC TCA GAC CCC ACC TAC ACC AGT GCC CTG 2304 
Arg Phe Leu Glu Asp Asp Thr Ser Asp Pro Thr Tyr Thr Ser Ala Leu 
755 760 765 

GGC GGA AAG TTC CCC ATC CGC TGG ACA GCC CCG GAA GCC ATC CAG TAC 2352 
Gly Gly Lys Phe Pro He Arg Trp Thr Ala Pro Glu Ala He Gin Tyr 
770 775 780 

CGG AAG TTC ACC TCG GCC AGT GAT GTG TGG AGC TAC GGC ATT GTC ATG 2400 
Arg Lys Phe Thr Ser Ala Ser Asp Val Trp Ser Tyr Gly He Val Met 
785 790 795 800 

TGG GAG GTG ATG TCC TAT GGG GAG CGG CCC TAC TGG GAC ATG ACC AAC 2448 
Trp Glu Val Met Ser Tyr Gly Glu Arg Pro Tyr Trp Asp Met Thr Asn 
805 810 815 
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FIG. IE 

CAG GAT GTA ATC AAT GCC ATT GAG CAG GAC TAT CGG CTG CCA CCG CCC 2496 
Gin Asp Val lie Asn Ala lie Glu Gin Asp Tyr Arg Leu Pro Pro Pro 
820 825 1 830 

ATG GAC TGC CCG AGC GCC CTG CAC CAA' CTC ATG CTG GAC TGT TGG CAG 2544 
Met Asp Cys Pro Ser Ala Leu His Gin Leu Met Leu Asp Cys Trp Gin 
835 840 845 

i 

AAG GAC CGC AAC CAC CGG CCC AAG TTC GGC CAA ATT GTC AAC ACG CTA 2592 
Lys Asp Arg Asn His Arg Pro Lys Phe Gly Gin He Val Asn Thr Leu 
850 855 860 

GAC AAG ATG ATC CGC AAT CCC AAC AGC CTC AAA GCC ATG GCG CCC CTC 2640 
Asp Lys Met He Arg Asn Pro Asn Ser Leu Lys Ala Met Ala Pro Leu 
865 870 875 , 880 

TCC TCT GGC ATC AAC CTG CCG CTG CTG GAC CGC ACG ATC CCC GAC TAC 2688 
Ser Ser Gly He Asn Leu Pro, Leu Leu Asp Arg Thr He Pro Asp Tyr , 
885 890 895 

ACC AGC TTT AAC ACG GTG GAC GAG TGG CTG GAG GCC ATC AAG ATG GGG 2736 
Thr Ser Phe Asn Thr Val Asp Glu Trp Leu Glu Ala He Lys Met Gly 
900 905 910 

CAG TAC AAG GAG AGC TTC GCC 'AAT GCC GGC TTC ACC TCC TTT GAC GTC 2784 
Gin Tyr Lys Glu Ser Phe Ala Asn Ala Gly Phe Thr Ser Phe Asp Val 
915 920 925 

GTG TCT CAG ATG ATG ATG GAG GAC ATT CTC CGG GTT GGG GTC ACT TTG 2832 
Val Ser Gin Met Met Met Glu Asp He Leu Arg Val Gly Val Thr Leu 
930 935 940 

GCT GGC CAC CAG AAA AAA ATC CTG AAC AGT ATC CAG GTG ATG CGG GCG 2880 
Ala Gly His Gin Lys Lys He Leu Asn Ser He Gin Val Met Arg Ala 
945 950 955 960 

CAG ATG AAC CAG ATT CAG TCT GTG GAG GTT TGACATTCAC CTGCCTCGGC 2930 
Gin Met Asn Gin He Gin Ser Val Glu Val 
965 970 

TCACCTCTTC CTCCAAGCCC CGCCCCCTCT GC 2962 
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FIG. ZA 

CCA GCG TCC CTG GCC GGC TGC TAC TCT GCA CCT CGA CGG GCT CCC CTC 
Pro Ala Ser Leu Ala Gly Cys Tyr Ser Ala Pro Arg Arg Ala Pro Leu 
1 5 10 i«; 



48 



TGG ACG TGC CTT CTC CTG TGC GCC GCA CpC CGG ACC CTC CTG GCC AGC 96 
Trp Thr Cys Leu Le,u Leu Cys Ala Ala Leu Arg Thr. Leu Leu Ala Ser 1 
20 25 30 

CCC AGC AAC GAA GTG AAT TTA TTG GAT TCA CGC ACT GTC ATG GGG GAC 144 
Pro Ser Asn Glu Val Ash Leu Leu Asp Ser Arg Thr Val Met Gly Asd 
35 1 40 , 45 

CTG GGA TGG ATT GCT TTT CCA AAA AAT GGG TGG GAA GAG ATT GGT GAA • 192 
Leu Gly Trp He Ala Phe Pro Lys Asn Gly Trp Glu Glu He Gly Glu 
50 . '55 60 

GTG GAT GAA AAT TAT GCC._CCT ATC CAC ACA TAC CAA GTA,, TGC AAA GTG 240 
Val Asp Glu Asn Tyr Ala Pro He His Thr Tyr Gin Val Cys Lys Val ! 
65 70 7,5 80 

ATG GAA CAG AAT CAG AAT AAC TGG CTT TTG ACC AGT TGG ATC TCC AAT 288 
Met Glu Gin Asn Gin Asn Asn Trp Leu Leu Thr Ser Trp He Ser Asn 

85 90 95 

GAA GGT GCT TCC AGA ATC TTC ATA GAA CTC AAA TTT ACC CTG CGG GAC 336 
Glu Gly Ala Ser Arg He Phe He Glu Leu Lys Phe Thr Leu Arg Asp 
100 105 110 

i 

TGC AAC AGC CTT CCT GGA GGA CTG GGG ACC TGT AAG GAA ACC TTT AAT 384 
Cys Asn Ser Leu Pro Gly Gly Leu Gly Thr Cys Lys Glu Thr Phe Asn 
115 120 125 

ATG TAT TAC TTT GAG TCA GAT GAT CAG AAT GGG AGA AAC ATC AAG GAA 432 
Met Tyr Tyr Phe Glu Ser Asp Asp Gin Asn Gly Arg Asn He Lys Glu 
130 135 140 

AAC CAA TAC ATC AAA ATT GAT ACC ATT GCT GCC GAT GAA AGC TTT ACA 480 
Asn Gin Tyr He Lys He Asp Thr He Ala Ala Asp Glu Ser Phe Thr 
145 150 155 160 

GAA CTT GAT CTT GGT GAC CGT GTT ATG AAA CTG AAT ACA GAG GTC AGA 528 
Glu Leu Asp Leu Gly Asp Arg Val Met Lys Leu Asn Thr Glu Val Arg 
165 170 175 

GAT GTA GGA CCT CTA AGC AAA AAG GGA TTT TAT CTT GCT TTT CAA GAT 576 
Asp Val Gly Pro Leu Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp 
180 185 190 

GTT GGT GCT TGC ATT GCT CTG GTT TCT GTG CGT GTA TAC TAT AAA AAA 624 
Val Gly Ala Cys He Ala Leu Val Ser Val Arg Val Tyr Tyr Lys Lys 
195 200 205 
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FIG. 2B 

TGC CCT TCT GTG GTA CGA CAC TTG GCT GTC TTC CCT GAC ACC ATC ACT 672 
Cys Pro Ser Val Val Arg His Leu Ala Val Phe Pro Asp Thr lie Thr 
210 215 , , 220 

GGA GCT GAT TCT TCC CAA TTG CTC GAA GTG TCG GGC TCC TGT GTC AAC 720 
Gly Ala Asp Ser Ser Gin Leu Leu Glu Val Ser Gly Ser Cys Val Asn 
225 230 235 240 

i 

CAT TCT GTG ACC GAT GAA CCT CCC AAA ATG iCAC TGC AGC GCC GAA GGG 768 
His Ser Val Thr Asp Glu Pro Pro Lys Met His Cys Ser Ala Glu Gly 
245 250 .. 255 

GAG TGG CTG GTG CCC ATC GGG AAA TGC ATG TGC AAG GCA GGA TAT GAA 816 
Glu Trp Leu Val Pro He Gly Lys Cys Met Cys Lys Ala Gly Tyr Glu 
260 265 270 

GAG AAA AAT GGC ACC TGT CAA GTG TGC AGA CCT GGG TTC TTC AAA GCC 864 
Glu Lys Asn Gly Thr Cys Gin Val Cys Arg Pro Gly Phe Phe Lys Ala 
275 , 280 285 

TCA CCT CAC ATC CAG AGC TGC GGC AAA TGT CCA CCT CAC AGT TAT ACC 912 
Ser Pro His He Gin Ser Cys Gly Lys Cys Pro Pro His Ser Tyr Thr 
290 295 300 

CAT GAG GAA GCT TCA ACC TCT TGT GTC TGT GAA AAG GAT TAT TTC AGG 960 
His Glu Glu Ala Ser Thr Ser Cys Val Cys Glu Lys Asp Tyr, Phe Arg 
305 310 315 320 

AGA GAG TCT GAT CCA CCC ACA ATG GCA TGC ACA AGA CCC CCC TCT GCT 1008 
Arg Glu Ser Asp Pro Pro Thr Met Ala Cys Thr Arg Pro Pro Ser Ala 
325 330 335 

CCT CGG AAT GCC ATC TCA AAT GTT AAT GAA ACT AGT GTC TTT CTG GAA 1056 
Pro Arg Asn Ala He Ser Asn Val Asn Glu Thr Ser Val Phe Leu Glu 
340 345 350 

TGG ATT CCG CCT GCT GAC ACT GGT GGA AGG AAA GAC GTG TCA TAT TAT 1104 
Trp He Pro Pro Ala Asp Thr Gly Gly Arg Lys Asp Val Ser Tyr Tyr 
355 360 365 

ATT GCA TGC AAG AAG TGC AAC TCC CAT GCA GGT GTG TGT GAG GAG TGT 1152 
He Ala Cys Lys Lys Cys Asn Ser His Ala Gly Val Cys Glu Glu Cys 
370 375 380 

GGC GGT CAT GTC AGG TAC CTT CCC CGG CAA AGC GGC CTG AAA AAC ACC 1200 
Gly Gly His Val Arg Tyr Leu Pro Arg Gin Ser Gly Leu Lys Asn Thr 
385 390 395 400 

TCT GTC ATG ATG GTG GAT CTA CTC GCT CAC ACA AAC TAT ACC TTT GAG 1248 
Ser Val Met Met Val Asp Leu Leu Ala His Thr Asn Tyr Thr Phe Glu 
405 410 415 
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FIG. 2C 

ATT GAG GCA GTG AAT GGA GTG TCC GAC TTG AGC CCA GGA GCC CGG CAG 1296 
lie Glu Ala Val Asn Gly Val Ser Asp Leu Ser Pro Gly Ala Arg Gin 
420 425 430 

TAT GTG TCT GTA AAT GTA ACC ACA AAT CAA GCA GCT CCA TCT CCA GTC 1344 
Tyr Val Ser Val Asn Val Thr Thr Asn Gin Ala Ala Pro Ser Pro Val 
435 440 ' 445 

ACC AAT GTG AAA AAA GGG AAA ATT GCA AAA AAC AGC ATC TCT TTG TCT 1392 
Thr Asn Val Lys Lys Gly Lys He Ala Lys Asn Ser He Ser Leu Ser 
450 '455 , , 460 

TGG CAA GAA~CCA OT~CGT~CCC AAT GGA ATC ATC CTA GAG TAT GAA ATC 1 1440 
Trp Gin Glu Pro Asp Arg Pro Asn Gly He He Leu Glu Tyr Glu He 
465 470 475 480 

AAG CAT TTT GAA AAG GAC CAA GAG ACC AGC TAC ACG ATT ATC AAA TCT ■ 1488 
Lys His Phe Glu Lys Asp Gin Glu Thr Ser Tyr Thr He He Lys Ser 
485 490 ■ 495 

AAA GAG ACA ACT ATT ACT GCA GAG GGC TTG AAA CCA GCT TCA GTT TAT 1536 
Lys Glu Thr Thr He Thr Ala Glu Gly Leu Lys Pro Ala Ser Val Tvr 
500 505 510 

GTC TTC CAA ATT CGA GCA CGT ACA GCA GCA GGC TAT GGT GTC TTC AGT 1584 
Val Phe Gin He Arg Ala Arg Thr Ala Ala Gly Tyr Gly Val Phe Ser 

515 520 525 , 

CGA AGA TTT GAG TTT GAA ACC ACC CCA GTG TTT GCA GCA TCC AGC GAT 1632 
Arg Arg Phe Glu Phe Glu Thr Thr Pro Val Phe Ala Ala Ser Ser Asp 
530 535 540 

CAA AGC CAG ATT CCT GTA ATT GCT GTG TCT GTG ACA GTA GGA GTC ATT 1680 
Gin Ser Gin He Pro Val He Ala Val Ser Val Thr Val Gly Val He 
545 550 555. 560 

TTG TTG GCA GTG GTT ATC GGC GTC CTC CTC AGT GGA AGG CGG TGT GGC 1728 
Leu Leu Ala Val Val He Gly Val Leu Leu Ser Gly Arg Arg Cys Gly 
565 570 575 

TAC AGC AAA GCA AAA CAA GAT CCA GAA GAG GAA AAG ATG CAT TTT CAT 1776 
Tyr Ser Lys Ala Lys Gin Asp Pro Glu Glu Glu Lys Met His Phe His 
580 585 590 

AAT GGG CAC ATT AAA CTG CCA GGA GTA AGA ACT TAC ATT GAT CCA CAT 1824 
Asn Gly His He Lys Leu Pro Gly Val Arg Thr Tyr He Asp Pro His 
595 600 605 

ACC TAT GAG GAT CCC AAT CAA GCT GTC CAC GAA TTT GCC AAG GAG ATA 1872 
Thr Tyr Glu Asp Pro Asn Gin Ala Val His Glu Phe Ala Lys Glu He 
610 615 620 
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FIG. 2D 



GAA GCA TCA TGT ATC ACC ATT GAG AGA GTT- ATT GGA GCA GGT GAA TTT 
Glu Ala Ser Cys lie Thr lie Glu Arg Val He Gly Ala Gly Glu Phe 
625 630 635 640 



1920 



GGT GAA GTT TGT AGT GGA CGT TTG AAA CTA CCA GGA AAA AGA GAA TTA 
Gly Glu Val Cys Ser Gly Arg Leu Lys Leu Pro Gly Lys Arg Glu Leu 
' 645 • 650 1 655 



1968 



CCT GTG GCT ATC AAA ACp CTT AAA GTA GGC TAT ACT GAA AAG CAA CGC 2016 
Pro Val Ala lie Lys Thr, Leu Lys Val Gly Tyr Thr Glu Lys Gin Arg 
660 665 1 670 

AGA GAT TTC CTA 'GGT GAA GCA AGT ATC ATG GGA CAG TTT GAT CAT CCT ' 2064 
Arg Asp Phe Leu Gly Glu Ala Ser He Met Gly Gin Phe Asp His Pro 
675 ' 680 685 

AAC ATC^ATG-GAT-TTA -GAA GGT GTG GTG ACC AAA -AGT -AAA. CCA GTG ATG , 2112 
Asn He He His Leu Glu Gly Val Val Thr Lys Ser Lys Pro Val Met 
690 695 - 700 

ATC GTG ACA GAG TAT ATG GAG AAT GGC TCT TTA GAT ACA TTT TTG AAG 2160 
He Val Thr Glu Tyr Met Glu Asn Gly Ser Leu Asp Thr Phe Leu Lys 
705 710 715 720 

AAA AAC GAT GGG CAG TTC ACT GTG ATT CAG CTT GTT GGC ATG CTG AGA 2208 
Lys Asn Asp Gly Gin Phe Thr Val He Gin Leu Val Gly Met Leu Arg 

725 730 735 , 

GGT ATC TCT GCA GGA ATG AAG TAC CTT TCT GAC ATG GGC TAT GTG CAT 2256 
Gly He Ser Ala Gly Met Lys Tyr Leu Ser Asp Met Gly Tyr Val His 
740 745 750 

AGA GAT CTT GCT GCC AGA AAC ATC TTA ATC AAC AGT AAC CTT GTG TGC 2304 
Arg Asp Leu Ala Ala Arg Asn He Leu He Asn Ser Asn Leu Val Cys 
755 760 765 

AAA GTG TCT GAC TTT GGA CTT TCC CGG GTA CTG GAA GAT GAT CCC GAG 2352 
Lys Val Ser Asp Phe Gly Leu Ser Arg Val Leu Glu Asp Asp Pro Glu 
770 775 780 

GCA GCC TAC ACC ACA AGG GGA GGA AAA ATT CCA ATC AGA TGG ACT GCC 2400 
Ala Ala Tyr Thr Thr Arg Gly Gly Lys He Pro He Arg Trp Thr Ala 
785 790 795 800 

CCA GAA GCA ATA GCT TTC CGA AAG TTT ACT TCT GCC AGT GAT GTC TGG 2448 
Pro Glu Ala He Ala Phe Arg Lys Phe Thr Ser Ala Ser Asp Val Trp 
805 810 815 

AGT TAT GGA ATA GTA ATG TGG GAA GTT GTG TCT TAT GGA GAG AGA CCC 2496 
Ser Tyr Gly He Val Met Trp Glu Val Val Ser Tyr Gly Glu Arg Pro 
820 825 830 
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TAC TGG GAG ATG ACC AAT CAA GAT GTG ATT AAA GCG GTA GAG GAA GGC 2544 
Tyr Trp Glu Met Thr Asn Gin Asp Val He Lys Ala Val Glu Glu Gly 
835 840 845 

TAT CGT CTG CCA AGC CCC ATG GAT TGT' CCT GCT GCT CTC TAT CAG TTA 2592 
Tyr Arg Leu Pro Ser Pro Met Asp Cys Pro Ala Ala Leu Tyr Gin Leu 
850 855 860 

ATG CTG GAT TGC TGG CAG AAA GAG CGA AAT AGC AGG CCC AAG TTT GAT 2640 
Met Leu Asp Cys Trp Gin Lys Glu Arg Asn Ser Arg Pro Lys Phe Asp 
865 870 875 880 



GAA ATA GTC AAC ATG TTG GAC AAG CTG ATA CGT AAC CCA AGT AGT CTG 2688 
Glu He Val Asn Met Leu Asp Lys Leu He Arg Asn Pro Ser Ser Leu 
885 890 895 

AAG ACG CTG GTT AAT GCA TCC TGC AGA GTA TCT AAT TTA TTG GCA GAA 2736 
Lys Thr Leu Val Asn Ala Ser Cys Arg Val Ser Asn Leu Leu Ala Glu 
900 905 910 

CAT AGC CCA CTA GGA TCT GGG GCC TAC AGA TCA GTA GGT GAA TGG CTA 2784 
His Ser Pro Leu Gly Ser Gly Ala Tyr Arg Ser Val Gly Glu Trp Leu 
915 920 925 

GAG GCA ATC AAG ATG GGC CGG TAT ACA GAG ATT TTC ATG GAA AAT GGA 2832 
Glu Ala He Lys Met Gly Arg Tyr Thr Glu He Phe Met Glu Asn Gly 
930 935 1 940 

TAC AGT TCA ATG GAC GCT GTG GCT CAG GTG ACC TTG GAG GAT TTG AGA 2880 
Tyr Ser Ser Met Asp Ala Val Ala Gin Val Thr Leu Glu Asp Leu Arg 
945 950 955 960 

CGG CTT GGA GTG ACT CTT GTC GGT CAC CAG AAG AAG ATC ATG AAC AGC 2928 
Arg Leu Gly Val Thr Leu Val Gly His Gin Lys Lys He Met Asn Ser 
965 970 975 

CTT CAA GAA ATG AAG GTG CAG CTG GTA AAC GGA ATG GTG CCA TTG TAACTTCATG 
2983 

Leu Gin Glu Met Lys Val Gin Leu Val Asn Gly Met Val Pro Leu 
980 985 990 

TAAATGTCGC TTCTTCAAGT GAATGATTCT GCACTTTGTA AACAGCACTG AGATTTATTT 3043 

TAACAAAAAA AGGGGGAAAA GGGAAAACAG TGATTTCTAA ACCTTAGAAA ACATTTGCCT 3103 

CAGCCACAGA ATTTGTAATC ATGGTTTTAC TGAAGTATCC AGTTCTTAGT CCTTAGTCT 3162 
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FIG. 3A 

AAGCGGCAGG AGCAGCGTTG GCACCGGCGA ACC ATG GCT, GGG ATT TTC TAT TTC 54 

Met Ala Gly He Phe Tyr Phe 
1 5 

GCC CTA TTT TCG TGT CTC TTC GGG ATT TGC GAC. GCT GTC ACA GGT TCC 102 
Ala Leu Phe Ser Cys Leu Phe Gly He Cys Asp Ala Val Thr Gly Ser 
10 15 , 20 

AGG GTA TAC CCC GCG AAT GAA GTT ACC TTA TTG GAT TCC AGA TCT GTT 150 
Arg Val Tyr Pro Ala Asn Glu Val Thr Leu Leu Asp Ser Arg Ser Val 
25 30 35 

CAG GGA GAA CTT GGG TGG ATA GCA AGC CCT CTG GAA GGA GGG TGG GAG 198 
Gin Gly Glu Leu Gly Trp He Ala Ser Pro Leu Glu Gly Gly Trp Glu 

._40. 45 50 55 



GAA GTG AGT ATC ATG GAT GAA AAA AAT ACA CCA ATC CGA ACC TAC CAA 246 
Glu Val Ser He Met Asp Glu Lys Asn Thr Pro He Arg Thr Tyr Gin ' 

60 65 70 

GTG TGC AAT GTG ATG GAA CCC AGC CAG AAT AAC TGG CTA CGA ACT GAT 294 
Val Cys Asn Val Met Glu Pro Ser Gin Asn Asn Trp Leu Arg Thr Asp 
75 80 85 

i 

TGG ATC ACC CGA GAA GGG GCT CAG AGG GTG TAT ATT GAG ATT AAA TTC 342 
Trp He Thr Arg Glu Gly Ala Gin Arg Val Tyr He Glu He Lys Phe 
90 95 100 

ACC TTG AGG GAC TGC AAT AGT CTT CCG GGC GTC ATG GGG ACT TGC AAG 390 
Thr Leu Arg Asp Cys Asn Ser Leu Pro Gly Val Met Gly Thr Cys Lys 
105 110 115 

GAG ACG TTT AAC CTG TAC TAC TAT GAA TCA GAC AAC GAC AAA GAG CGT 438 
Glu Thr Phe Asn Leu Tyr Tyr Tyr Glu Ser Asp Asn Asp Lys Glu Arg 
120 125 130 135 

TTC ATC AGA GAG AAC CAG TTT GTC AAA ATT GAC ACC ATT GCT GCT GAT 486 
Phe He Arg Glu Asn Gin Phe Val Lys He Asp Thr He Ala Ala Asp 
140 145 150 

GAG AGC TTC ACC CAA GTG GAC ATT GGT GAC AGA ATC ATG AAG CTG AAC 534 
Glu Ser Phe Thr Gin Val Asp He Gly Asp Arg He Met Lys Leu Asn 
155 160 165 

ACC GAG ATC CGG GAT GTA GGG CCA TTA AGC AAA AAG GGG TTT TAC CTG 582 
Thr Glu He Arg Asp Val Gly Pro Leu Ser Lys Lys Gly Phe Tyr Leu 
170 175 180 
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FIG. 3B . 

GCT TTT CAG GAT GTG GGG GCC TGC ATC GCC CTG GTA TCA GTC CGT CTC «n 
Ala Phe Gin Asp Val Gly Ala Cys lie Ala Leu Val Ser vS. Sg 3 ° 
185 . 190 195 

TTC TAT AAA AAG TGT CCA CTC ACA GTC CGC AAT CTG GCC CAG TTT CCT fi7A 
Phe Tyr Lys Lys Cys Pro Leu Thr Val Arg Asn Leu Ala Gin Phe Pro 
200 . i -205 210 ' 215 

GAC ACC ATC ACA GGG GCT GAT ACG TCT TCC CTG GTG GAA GTT CGA GGC 79fi 
Asp Thr lie Thr Gly A*a Asp Thr Ser Ser Leu Val Glu Val Arg Sv 
220 , 225 230 

TCC TGT GTC AAC AAC TCA GAA GAG AAA GAT GTG CCA AAA ATG TAC TGT 774 
Ser Cys Val Asn Asn Ser Glu Glu Lys Asp Val Pro Lys Met Tyr Cys 
235 240 245 

GGG GCA GAT GGT GAA TGG CTG GTA CCC ATT GGC AAC TGC CTA TGC AAC 
Gly Ala Asp Gly Glu -Trp Leu Val Pro He Gly Asn Cys Leu Cys Asn 
250 255 260 

GCT GGG CAT GAG GAG CGG AGC GGA GAA TGC CAA GCT TGC AAA ATT GGA 870 
Ala Gly His Glu Glu Arg Ser Gly Glu Cys Gin Ala Cys Lys He Glv 
265 270 275 

TAT TAC AAG GCT CTC TCC ACG GAT GCC ACC TGT GCC AAG TGC CCA CCC 918 
Tyr Tyr Lys Ala Leu Ser Thr Asp Ala Thr Cys Ala Lys Cys Pro Pro 
280 285 290 295 

CAC AGC TAC TCT GTC TGG GAA GGA GCC ACC TCG TGC ACC TGT GAC CGA ' 966 
His Ser Tyr Ser Val Trp Glu Gly Ala Thr Ser Cys Thr Cys Asp Arq 
300 305 310 

GGC TTT TTC AGA GCT GAC AAC GAT GCT GCC TCT ATG CCC TGC ACC CGT 1014 
Gly Phe Phe Arg Ala Asp Asn Asp Ala Ala Ser Met Pro Cys Thr Ara 
315 320 325 

CCA CCA TCT GCT CCC CTG AAC TTG ATT TCA AAT GTC AAC GAG ACA TCT 1062 
Pro Pro Ser Ala Pro Leu Asn Leu He Ser Asn Val Asn Glu Thr Ser 
330 335 340 

GTG AAC TTG GAA TGG AGT AGC CCT CAG AAT ACA GGT GGC CGC CAG GAC 1110 
Val Asn Leu Glu Trp Ser Ser Pro Gin Asn Thr Gly Gly Arg Gin Asd 
345 350 355 

ATT TCC TAT AAT GTG GTA TGC AAG AAA TGT GGA GCT GGT GAC CCC AGC 1158 
He Ser Tyr Asn Val Val Cys Lys Lys Cys Gly Ala Gly Asp Pro Ser 
360 365 370 375 

AAG TGC CGA CCC TGT GGA AGT GGG GTC CAC TAC ACC CCA CAG CAG AAT 1206 
Lys Cys Arg Pro Cys Gly Ser Gly Val His Tyr Thr Pro Gin Gin Asn 
380 385 390 
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FIG. 5C 

GGC TTG AAG ACC ACC AAA GTC TCC ATC ACT GAC CTC CTA GCT CAT ACC 1254 
Gly Leu Lys Thr Thr Lys Val Ser He Thr Asp Leu Leu Ala His Thr 
395 400 405 

AAT TAC ACC TTT GAA ATC TGG GCT GTG AAT GGA GTG TCC AAA TAT AAC 1302 
Asn Tyr Thr Phe GJ.u He Trp Ala Val Asn Gly Val. Ser Lys Tyr Asn 
410 415 i 420 

CCT AAC CCA GAC CAA TCA GTT TCT GTC ACT GTG ACC ACC AAC CAA GCA 1350 
Pro Asn Pro Asp Gin Ser Val Ser Val Thr Val Thr Thr Asn Gin Ala 
425 ' 430 435 

GCA CCA TCA TCC 'ATT GCT TTG GTC CAG GCT AAA. GAA GTC ACA AGA TAC • 1398 
Ala Pro Ser Ser He Ala Leu Val Gin Ala Lys Glu Val Thr Arg Tyr 
440 445 450 455 

_AGT _GTG_GCA ,CTG GCT TGG CTG GAA CCA GA T CG G CCC AAT, GGG GTA ATC 1446 
Ser Val Ala Leu Ala Trp Leu Glu Pro Asp Arg Pro Asn Gly Val He 1 
460 465 , 470 

CTG GAA TAT GAA GTC AAG TAT TAT GAG AAG GAT CAG AAT GAG CGA AGC 1494 
Leu Glu Tyr Glu Val Lys Tyr Tyr Glu Lys Asp Gin Asn Glu Arg Ser 
475 480 485 

TAT CGT ATA GTT CGG ACA GCT GCC AGG AAC ACA GAT ATC AAA GGC CTG 1542 
Tyr Arg He Val Arg Thr Ala Ala Arg Asn Thr Asp He Lys Gly Leu 
490 495 500 

I 

AAC CCT CTC ACT TCC TAT GTT TTC CAC GTG CGA GCC AGG ACA GCA GCT 1590 
Asn Pro Leu Thr Ser Tyr Val Phe His Val Arg Ala Arg Thr Ala Ala 
505 510 515 

GGC TAT GGA GAC TTC AGT GAG CCC TTG GAG GTT ACA ACC AAC ACA GTG 1638 
Gly Tyr Gly Asp Phe Ser Glu Pro Leu Glu Val Thr Thr Asn Thr Val 
520 525 530 535 

CCT TCC CGG ATC ATT GGA GAT GGG GCT AAC TCC ACA GTC CTT CTG GTC 1686 
Pro Ser Arg He He Gly Asp Gly Ala Asn Ser Thr Val Leu Leu Val 
540 545 550 

TCT GTC TCG GGC AGT GTG GTG CTG GTG GTA ATT CTC ATT GCA GCT TTT 1734 
Ser Val Ser Gly Ser Val Val Leu Val Val He Leu He Ala Ala Phe 
555 560 565 

GTC ATC AGC CGG AGA CGG AGT AAA TAC AGT AAA GCC AAA CAA GAA GCG 1782 
Val He Ser Arg Arg Arg Ser Lys Tyr Ser Lys Ala Lys Gin Glu Ala 
570 575 580 

GAT GAA GAG AAA CAT TTG AAT CAA GGT GTA AGA ACA TAT GTG GAC CCC 1830 
Asp Glu Glu Lys His Leu Asn Gin Gly Val Arg Thr Tyr Val Asp Pro 
585 590 595 
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TTT ACG TAC GAA GAT CCC AAC CAA GCA GTG CGA GAG TTT GCC AAA GAA 1878 
Phe Thr Tyr Glu Asp Pro Asn Gin Ala Val Arg Glu Phe Ala Lys Glu 
600 605 610 615 

t 

t 

ATT GAC GCA TCC TGC ATT AAG ATT GAA AAA GTT ATA GGA GTT GGT GAA 1926 
lie Asp Ala Ser Cys He Lys He Glu Lys Val He Gly Val Gly Glu 
620. 625 630 

TTT GGT GAG GTA TGC AGT GGG CGT CTC AAA GTG CCT GGC AAG AGA GAG 1974 
Phe Gly Glu Val Cys Ser Gly Arg Leu Lys Val Pro Gly Lys Arg Glu 
635 ' 640 645 

ATC_TGT GTG.-.GCT ATC AAG ACT CTG AAA GCT GGT TAT ACA GAC AAA CAG 2022 
He Cys Val Ala He Lys Thr Leu Lys Ala Gly Tyr Thr Asp Lys Gin 
650 655 660 

AGG AGA GAC TTC CTG AGT GAG GCC AGC ATC ATG GGA CAG TTT GAC CAT 2070 
Arg-Arg Asp Phe -Leu Ser -Glu- Ala Ser He Met Gly Gin Phe Asp His 
665 670 675 

CCG AAC ATC ATT CAC TTG GAA GGC GTG GTC ACT AAA TGT AAA CCA GTA 2118 
Pro Asn lie He His Leu Glu Gly Val Val Thr Lys Cys Lys Pro Val 
680 685 690 695 

ATG ATC ATA ACA GAG TAC ATG GAG AAT GGC TCC TTG GAT GCA TTC CTC 2166 
Met He He Thr Glu Tyr Met Glu Asn Gly Ser Leu Asp Ala Phe Leu 
700 1 705 710 

AGG AAA AAT GAT GGC AGA TTT ACA GTC ATT CAG CTG GTG GGC ATG CTT 2214 
Arg Lys Asn Asp Gly Arg Phe Thr Val He Gin Leu Val Gly Met Leu 
715 720 725 

CGT GGC ATT GGG TCT GGG ATG AAG TAT TTA TCT GAT ATG AGC TAT GTG 2262 
Arg Gly He Gly Ser Gly Met Lys Tyr Leu Ser Asp Met Ser Tyr Val 
730 735 740 

CAT CGT GAT CTG GCC GCA CGG AAC ATC CTG GTG AAC AGC AAC TTG GTC 2310 
His Arg Asp Leu Ala Ala Arg Asn He Leu Val Asn Ser Asn Leu Val 
745 750 755 

TGC AAA GTG TCT GAT TTT GGC ATG TCC CGA GTG CTT GAG GAT GAT CCG 2358 
Cys Lys Val Ser Asp Phe Gly Met Ser Arg Val Leu Glu Asp Asp Pro 
760 765 770 775 

GAA GCA GCT TAC ACC ACC AGG GGT GGC AAG ATT CCT ATC CGG TGG ACT 2406 
Glu Ala Ala Tyr Thr Thr Arg Gly Gly Lys He Pro He Arg Trp Thr 
789 785 790 

GCG CCA GAA GCA ATT GCC TAT CGT AAA TTC ACA TCA GCA AGT GAT GTA 2454 
Ala Pro Glu Ala He Ala Tyr Arg Lys Phe Thr Ser Ala Ser Asp Val 
795 800 805 
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FIG. 3E 

' TGG AGC TAT GGA ATC GTT ATG TGG GAA GTG ATG TCG TAC GGG GAG AGG 2502 
Trp Ser Tyr Gly He Val Met Trp Glu Val Met Ser Tyr Gly Glu Arg 
810 815 820 

CCC TAT TGG GAT ATG TCC AAT CAA GAT G^G ATT AAA GCC ATT GAG GAA 2550 
Pro Tyr Trp Asp M$t Ser Asn Gin Asp Val He Lys 'Ala He Glu Glu 
825 830 « 835 



GGC TAT CGG TTA CCC CCT CCA ATG GAC TGC CCC ATT GCG CTC CAC CAG 2598 

Gly Tyr Arg Leu Pro Pro' Pro Met Asp Cys Pro He Ala Leu His Gin 

840 845 850 855 

i 

CTG ATG CTA GAC TGC TGG CAG AAG GAG AGG AGC GAC AGG CCT AAA TTT 2646 

Leu Met Leu Asp Cys Trp- Gin Lys Glu Arg Ser Asp Arg Pro Lys Phe 
860 ' 865 870 

GGG CAG ATT GTC AAC ATG TTG GAC AAA CTC ATC CGC AAC^ CCC AAC AGC ' 2694 
Gly Gin lie Val Asn Met Leu Asp Lys Leu lie Arg Asn Pro Asn Ser 

875 880 885 

i 

TTG AAG AGG ACA GGG ACG GAG AGC TCC AGA CCT AAC ACT GCC TTG TTG 2742 
Leu Lys Arg Thr Gly Thr Glu Ser Ser Arg Pro Asn Thr Ala Leu Leu 
890 895 900 

i 

GAT CCA AGC TCC CCT GAA TTC TCT GCT GTG GTA TCA GTG GGC GAT TGG 2790 
Asp Pro Ser Ser Pro Glu Phe Ser Ala Val Val Ser Val Gly Asp Trp 

905 910 915 1 

CTC CAG GCC ATT AAA ATG GAC CGG TAT AAG GAT AAC TTC ACA GCT GCT 2838 
Leu Gin Ala lie Lys Met Asp Arg Tyr Lys Asp Asn Phe Thr Ala Ala 
920 925 930 935 

GGT TAT ACC ACA CTA GAG GCT GTG GTG CAC GTG AAC CAG GAG GAC CTG 2886 
Gly Tyr Thr Thr Leu Glu Ala Val Val His Val Asn Gin Glu Asp Leu 
940 945 950 

GCA AGA ATT GGT ATC ACA GCC ATC ACG CAC CAG AAT AAG ATT TTG AGC 2934 
Ala Arg lie Gly lie Thr Ala He Thr His Gin Asn Lys He Leu Ser 
955 960 965 

AGT GTC CAG GCA ATG CGA ACC CAA ATG CAG CAG ATG CAC GGC AGA ATG 2982 
Ser Val Gin Ala Met Arg Thr Gin Met Gin Gin Met His Gly Arg Met 
970 975 980 

GTT CCC GTC TGAGCCAGTA CTGAATAAAC TCAAAACTCT TGAAATTAGT 3031 
Val Pro Val 
985 

TTACCTCATC CATGCACTTT AATTGAAGAA CTGCACTTTT TTTACTTCGT CTTCGCCCTC 3091 

TGAAATTAAA GAAATGAAAA AAAAA 3116 
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FIG. 4A 

CGGTGCGAGC GAACAG6AGT GGGGGGGAAA TTAAAAAAAG CTAAACGTGG AGCAGCCGAT 60 

CGGGGACCGA GAAGGG6AAT CGAT6CAAG6 AGCACACTAA AACAAAAGCT ACTTCGGAAC 120 

AAACAGCATT TAAAAATCCA CGACTCAAGA TAACTGAAAC CTAAAATAAA ACCTGCTCAT 180 

GCACC ATG GTT TTT CAA ACT CGG TAC CCT TCA TGG ATT ATT TTA TGC 227 
Met Val Phe Gin Thr Arg Tyr Pro Ser Trp He He Leu Cys 
1 5 ,10 

i 

TAC ATC TGG CTG CTC CGC TTT GCA CAC ACA GGG GAG GCG CAG GCT GCG 275 
Tyr He Trp Leu Leu Arg Phe Ala His Thr Gly Glu Ala Gin Ala Ala 
15 20 25 30 - 

AAG GAA GTA CTA CTG CTG GAT TCT AAA GCA CAA CAA ACA GAG TTG GAG 323 
Lys Glu Val Leu Leu Leu Asp Ser Lys Ala Gin Gin Thr Glu Leu Glu 

35 40 45 

TGG ATT TCC TCT CCA CCC AAT ( GGG TGG GAA GAA ATT AGT GGT TTG GAT 371 
Trp He Ser Ser Pro Pro Asn Gly Trp Glu Glu He Ser Gly Leu Asp ' 
50 55 60 

GAG AAC TAT ACC CCG ATA CGA ACA TAC CAG GTG TGC CAA GTC ATG GAG 419 
Glu Asn Tyr Thr Pro He Arg Thr Tyr Gin Val Cys Gin Val Met Glu 
65 70 75 

i 

CCC AAC CAA AAC AAC TGG CTG CGG ACT AAC TGG ATT TCC AAA GGC AAT 467 
Pro Asn Gin Asn Asn Trp Leu Arg Thr Asn Trp He Ser Lys Gly Asn 
80 85 90 

GCA CAA AGG ATT TTT GTA GAA TTG AAA TTC ACC CTG AGG GAT TGT AAC 515 
Ala Gin Arg He Phe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn 
95 100 105 no 

AGT CTT CCT GGA GTA CTG GGA ACT TGC AAG GAA ACA TTT AAT TTG TAC 563 
Ser Leu Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr 
115 120 125 

TAT TAT GAA ACA GAC TAT GAC ACT GGC AGG AAT ATA AGA GAA AAC CTC 611 
Tyr Tyr Glu Thr Asp Tyr Asp Thr Gly Arg Asn He Arg Glu Asn Leu 
130 135 140 

TAT GTA AAA ATA GAC ACC ATT GCT GCA GAT GAA AGT TTT ACC CAA GGT 659 
Tyr Val Lys He Asp Thr He Ala Ala Asp Glu Ser Phe Thr Gin Gly 
145 150 155 

GAC CTT GGT GAA AGA AAG. ATG AAG CTT AAC ACT GAG GTG AGA GAG ATT 707 
Asp Leu Gly Glu Arg Lys Met Lys Leu Asn Thr Glu Val Arg Glu He 
160 165 170 
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FIG. 4B 

GGA CCT TTG TCC AAA AAG GGA TTC TAT CTT GCC TTT CAG GAT GTA GGG 755 
Gly Pro Leu Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Glv 
175 180 1 185 ' 190 

GCT TGC ATA GCT TTG GTT TCT GTC AAA GTG TAC TAC AAG AAG TGC TGG 803 
Ala Cys lie Ala Leu Val Ser Val Lys Val Tyr Tyr Lys Lys Cys Tip 
. 195 200 205 

TCC ATT ATT GAG AAC TTA GCT ATC TTT CCA GAT ACA GTG ACT GGT TCA 851 
Ser lie lie Glu Asn Leu Ala lie Phe Pro Asp Thr Val Thr Gly Ser 
210 215 220 

GAA TTT TCC TCT TTA GTC GAG GTT CGA GGG ACA TGT GTC AGC AGT GCA 899 
Glu Phe Ser Ser Leu Val Glu Val Arg Gly Thr Cys Val Ser Ser Ala 
225 230 235 

GAG GAA GAA GCG GAA AAC GCC CCC AGG ATG CAC TGC AGT GCA GAA GGA 947 
Glu Glu Glu Ala Glu Asn Ala Pro Arg Met His Cys Ser Ala Glu Gly 
240 245' 250 

GAA TGG TTA GTG CCC ATT GGA AAA TGT ATC TGC AAA GCA GGC TAC CAG 995 
Glu Trp Leu Val Pro lie Gly Lys Cys lie Cys Lys Ala Gly Tyr Gin 
255 260 265 270 

CAA AAA GGA GAC ACT TGT GAA ,CCC TGT GGC CGT GGG TTC TAC AAG TCT 1043 
Gin Lys Gly Asp Thr Cys Glu Pro Cys Gly Arg Gly Phe Tyr Lys Ser 
275 280 285 

TCC TCT CAA GAT CTT CAG TGC TCT CGT TGT CCA ACT CAC AGT TTT TCT 1091 
Ser Ser Gin Asp Leu Gin Cys Ser Arg Cys Pro Thr His Ser Phe Ser 
290 295 300 

GAT AAA GAA GGC TCC TCC AGA TGT GAA TGT GAA GAT GGG TAT TAC AGG 1139 
Asp Lys Glu Gly Ser Ser Arg Cys Glu Cys Glu Asp Gly Tyr Tyr Arg 
305 310 315 

GCT CCA TCT GAC CCA CCA TAC GTT GCA TGC ACA AGG CCT CCA TCT GCA 1187 
Ala Pro Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg Pro Pro Ser Ala 
320 325 330 

CCA CAG AAC CTC ATT TTC AAC ATC AAC CAA ACC ACA GTA AGT TTG GAA 1235 
Pro Gin Asn Leu He Phe Asn He Asn Gin Thr Thr Val Ser Leu Glu 
335 340 345 350 

TGG AGT CCT CCT GCA GAC AAT GGG GGA AGA AAC GAT GTG ACC TAC AGA 1283 
Trp Ser Pro Pro Ala Asp Asn Gly Gly Arg Asn Asp Val Thr Tyr Arg 
355 360 365 

ATA TTG TGT AAG CGG TGC AGT TGG GAG CAG GGC GAA TGT GTT CCC TGT 1331 
He Leu Cys Lys Arg Cys Ser Trp Glu Gin Gly Glu Cys Val Pro Cys 
370 375 380 
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FIG. 4C 

■SS Ser JES ?r ^ CCC CAG CAG ' ACT ^ TOA ^ AAC 1379 

Gly Ser Asn lie Gly Tyr Met Pro Gin Gin. Thr Gly Leu Glu Asp Asn 
385 390 395 



TAT GTC ACT GTC ATG GAC CTG CTA GCC CAC GCT AAT TAT ACT TTT GAA* 
Tyr Va Thr Val Met Asp Leu Leu Ala His Ala Asn ™ J£ 
* uu 405 410 

GTT GAA GCT GTA AAT GGA GTT TCT GAC TTA AGC CGA TCC CAG AGG CTC 
Val Glu Ala Val Asn Gly ,Val Ser Asp Leu Ser Arg Ser Gin Arg Leu 
qib 420 ' 425 430 

TTT GCT GCT GTC AGT ATC ACC ACT GGT CAA GCA GCT CCC TCG CAA GTG 
Phe Ala Ala Val Ser lle.Thr Thr Gly Gin Ala Ala Pro Ter X Va! 

435 , 440 445 

AGC GGA GTA ATG AAG GAG AGA GTA CTG CAG CGG AGT GTC GAG CTT TCC 
Ser Gly Val Met Lys Glu Arg Val Leu Gin Arg Ser Val Glu ^ 
450 455 ' 4 6 o 



1427 



1475 



1523 



1571 



1619 



1715 



TGG CAG GAA CCA GAG CAT CCC AAT GGA GTC ATC ACA GAA TAT GAA ATC 
Trp Gin Glu Pro Glu His Pro Asn Gly Val lie Thr Glu Tyr gJu" iTe 
465 470 475 

AAG TAT TAC GAG AAA GAT CAA, AGG GAA CGG AC? TAC TCA ACA GTA AAA 1667 
Lys Tyr Tyr Glu Lys Asp Gin Arg Glu Arg Tht Tyr Ser Thr Val Lys 

480 485 490 i 

ACC AAG TCT ACT TCA GCC TCC ATT AAT AAT CTG AAA CCA GGA ACA GTG 
Thr Lys Ser Thr Ser Ala Ser He Asn Asn Leu Lys Pro Gly Thr Val 
95 500 505 510 

TAT GTT TTC CAG ATT CGG GCT TTT ACT GCT GCT GGT TAT GGA AAT TAC 17^ 
Tyr Val Phe Gin lie Arg Ala Phe Thr Ala Ala Gly Tyr Gly J£ 2£ 1? " 
515 520 525 

AGT CCC AGA CTT GAT GTT GCT ACA CTA GAG GAA GCT ACA GGT AAA ATG 1811 
Ser Pro Arg Leu Asp Val Ala Thr Leu Glu Glu Ala Thr Gly Lys Met 
530 535 540 

TTT GAA GCT ACA GCT GTC TCC AGT GAA CAG AAT CCT GTT ATT ATC ATT 1859 
Phe Glu Ala Thr Ala Val Ser Ser Glu Gin Asn Pro Val He He He 
545 550 555 

GCT GTG GTT GCT GTA GCT GGG ACC ATC ATT TTG GTG TTC ATG GTC TTT 1907 
Ala Val Val Ala Val Ala Gly Thr He He Leu Val Phe Met Val Phe 
560 565 570 

GGC TTC ATC ATT GGG AGA AGG CAC TGT GGT TAT AGC AAA GCT GAC CAA 1955 
Gly Phe He He Gly Arg Arg His Cys Gly Tyr Ser Lys Ala Asp Gin 
575 580 585 cqn 
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19/33 FIG. 4D 

GAA GGC GAT GAA GAG CTT TAC TTT CAT TTT' AAA TTT' CCA GGC ACC AAA 
Glu Gly Asp Glu Glu Leu Tyr Phe His Phe. Lys Phe Pro Gly Thr Lys 
595 600 605 



2003 



ACC TAC ATT GAC CCT GAA ACC TAT GAG GAC CCA AAT AGA GCT GTC CAT 
Thr Tyr lie Asp Pro Glu Thr Tyr Glu Asp Pro Asn Arg Ala Val His 1 
6i0 1 615 , ' 620 



2051 



CAA TTC GCC AAG GAG CTA GAT GCC TCC TGT ATT AAA ATT GAG CGT GTG 2099 
Gin Phe Ala Lys Glu Leu i Asp Ala Ser Cys lie Lys He Glu Arg Val 
625 630 ' 635 

ATT GGT GCA GGA GAA TTC GGT GAA GTC TGC AGT GGC CGT TTG AAA CTT 2147_ 
He Gly Ala Gly Glu Phe, Gly Glu Val Cys Ser Gly Arg Leu Lys Leu 
640 645 . 650 

CCA GGG AAA AGA GAT GTT GCA GTA GCC ATA AAA ACC CTG* AAA GTT GGT 1 2195 
Pro Gly Lys Arg Asp Val Ala Val Ala He Lys Thr Leu Lys Val Gly 
655 660 665 670 

i 

TAC ACA GAA AAA CAA AGG AGA GAC TTT TTG TGT GAA GCA AGC ATC ATG 2243 
Tyr Thr Glu Lys Gin Arg Arg Asp Phe Leu Cys Glu Ala Ser He Met 
675 680 685 

GGG CAG TTT GAC CAC CCA AAT GTT GTC CAT TTG GAA GGG GTT GTT ACA 2291 
Gly Gin Phe Asp His Pro Asn Val Val His Leu Glu Gly Val Val Thr 

690 695 700 ' 

AGA GGG AAA CCA GTC ATG ATA GTA ATA GAG TTC ATG GAA AAT GGA GCC 2339 
Arg Gly Lys Pro Val Met He Val He Glu Phe Met Glu Asn Gly Ala 
705 710 715 

CTA GAT GCA TTT CTC AGG AAA CAT GAT GGG CAA TTT ACA GTC ATT CAG 2387 
Leu Asp Ala Phe Leu Arg Lys His Asp Gly Gin Phe Thr Val He Gin 
720 725 730 

TTA GTA GGA ATG CTG AGA GGA ATT GCT GCT GGA ATG AGA TAT TTG GCT 2435 
Leu Val Gly Met Leu Arg Gly He Ala Ala Gly Met Arg Tyr Leu Ala 
735 740 745 750 

GAT ATG GGA TAT GTT CAC AGG GAC CTT GCA GCT CGC AAT ATT CTT GTC 2483 
Asp Met Gly Tyr Val His Arg Asp Leu Ala Ala Arg Asn He Leu Val 
755 760 765 

AAC AGC AAT CTC GTT TGT AAA GTG TCA GAT TTT GGC CTG TCC CGA GTT 2531 
Asn Ser Asn Leu Val Cys Lys Val Ser Asp Phe Gly Leu Ser Arg Val 
770 775 780 

ATA GAG GAT GAT CCA GAA GCT GTC TAT ACA ACT ACT GGT GGA AAA ATT 2579 
He Glu Asp Asp Pro Glu Ala Val Tyr Thr Thr Thr Gly Gly Lys He 
785 790 795 
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20/ 33 FIG.4E 

CCA GTA AGG TGG ACA GCA CCC GAA GCC ATC CAG TAC CGG AAA TTC ACA 
Pro Val Arg Trp Thr Ala Pro Glu Ala lie Gin Tyr Arg Lys Phe Thr 
800 805 810 
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TCA GCC AGT GAT GTA TGG AGC TAT GGA ATA GTC ATG TGG GAA GTT ATG 
Ser Ala Ser Asp Val .Trp Ser Tyr Gly He Val Met Trp Glu Val Met 
815 , 820 825 830 
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TCT TAT GGA GAA AGA CCT TAT TGG GAC ATG TCA AAT CAA GAT GTT ATA 
Ser Tyr Gly Glu Arg Pro Tyr Trp Asp Met Ser Asn Gin Asp Val He 
835 ' 840 ' 845 
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AAA GCA ATA GAA GAA GGT TAT CGT TTA CCA GCA CCC ATG GAC TGC CCA 2771 
Lys Ala He Glu Glu Gly Tyr Arg Leu Pro Ala Pro Met Asp Cys Pro 
850 855 86,0 

GCT GGC CTT CAC CAG CTA ATG TTG GAT TGT TGG CAA AAG GAG CGT GCT 2819 
Ala Gly Leu His Gin Leu Met Leu Asp Cys Trp Gin Lys Glu Arg Ala 
865 , 870 875 

GAA AGG CCA AAA TTT GAA CAG ATA GTT GGA ATT CTA GAC AAA ATG ATT 2867 
Glu Arg Pro Lys Phe Glu Gin He Val Gly He Leu Asp Lys Met He 
880 885 890 

CGA AAC CCA AAT AGT CTG AAA ACT CCC CTG GGA ACT TGT AGT AGG CCA 2915 
Arg Asn Pro Asn Ser Leu Lys* Thr Pro Leu Gly Thr Cys Ser Arg Pro 
895 900 905 ' 910 

ATA AGC CCT CTT CTG GAT CAA AAC ACT CCT GAT TTC ACT ACC TTT TGT 2963 
He Ser Pro Leu Leu Asp Gin Asn Thr Pro Asp Phe Thr Thr Phe Cys 
915 920 925 

TCA GTT GGA GAA TGG CTA CAA GCT ATT AAG ATG GAA AGA TAT AAA GAT 3011 
Ser Val Gly Glu Trp Leu Gin Ala He Lys Met Glu Arg Tyr Lys Asp 
930 935 940 

AAT TTC ACG GCA GCT GGC TAC AAT TCC CTT GAA TCA GTA GCC AGG ATG 3059 
Asn Phe Thr Ala Ala Gly Tyr Asn Ser Leu Glu Ser Val Ala Arg Met 
945 950 955 

ACT ATT GAG GAT GTG ATG AGT TTA GGG ATC ACA CTG GTT GGT CAT CAA . 3107 
Thr He Glu Asp Val Met Ser Leu Gly He Thr Leu Val Gly His Gin 
960 965 970 

AAG AAA ATC ATG AGC AGC ATT CAG ACT ATG AGA GCA CAA ATG CTA CAT 3155 
Lys Lys He Met Ser Ser He Gin Thr Met Arg Ala Gin Met Leu His 
975 980 985 990 

TTA CAT GGA ACT GGC ATT CAA GTG TGATATGCAT TTCTCCCTTT TAAGGGAGAT 3209 
Leu His Gly Thr Gly He Gin Val 
995 
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TACAGACTGC AAGAGAACAG 


i i \j. ""f r 

TACTGGCCTT CAGTATATGC 


ATAGAATGCT 


GCTAGAAGAC 


3269 


AAGTGATGTC CTGGGTCCTT 


CCAACAGTGA AGAGAAGATT 


TAAGAAGCAC 


CTATAGACTT 


3329 


GAACTCCTAA GTGCCACCAG 


AATATATAAA AAGGGAATTT 


AGGATCCACC 


ATCGGTGGCC 


3389 


AGGAAAATAG CAGTGACAAT 


AAACAAAGTA CTACCTGAAA 


AACATCCAAA 


CACCTTGAGC 


3449 


TCTCTAACCT CCTTTTTGTC 


TTATAGACTT TTTAAAATGT 


ACATAAAGAA 


TTTAAGAAAG 


3509 


AATATATTTG TCAAATAAAA 


TCATGATCTT ATTGTTAAAA 


TTAATGAAAT 


ATTTTCCTTA 


3569 


AATATGTGAT TTCAGACTAT 


TCCTTTTTAA AATCATTTGT 


GTTTATTCTT 


CATAAGGACT 


3629 


TTGTTTTAGA AAGCTGTTTA 


TAGCTTTGGA CCTTTTTAGT 


GTTAAATCTG 


'TAACATTACT 


3689 


ACACTGGGTA CCTTTGAAAG 


AATCTCAAAT TTCAAAAGAA 


ATAGCATGAT 


TGAAGATACA 


3749 


TCTCTGTTAG AACATTGGTA 


TCCTTTTTGT GCCATTTTAT 


TCTGTTTAAT 


CAGTGCTGTT 


3809 
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GCAGGTAGTC AAGAAAATGC 


AAGTTGCCAA 


GAGCTCTGAT 


3869 


ATTTTTTAAA AAGAATTTTT 


TTGTAAAGAT CAGACAACAC 


ACTATCTTTT 


CAATGAAAAA 


3929 


AGCAATAATG ATCCATACAT 


ACTATAAGGC ACTTTTAACA 


GATTGTTTAT AGAGTGATTT 


3989 


TACTAGAAAG AATTTAATAA 


ACTCGAAGTT TAGGTTTATG 


AGTATATAAA 


CAAATGAGGC 


4049 


ACTTCATCTG AAGAATGTTG 


GTGAAGGCAA GTCTCTGAAA 


GCAGAACTAT 


CCAGTGTTAT 


4109 


CTAAAAATTA ATCTGAGCAC 


ATCAAGATTT TTTCATTCTC 


GTGACATTAG 


GAAATTTAGG 


4169 


ATAAATAGTT GACATATATT 


TTATATCCTC TTCTGTTGAA 


TGCAGTCCAA 


ACATGAAAGG 


4229 


AAATAATTGT TTTATATTAT 


AACTCTGAAG CATGATAAAG 


GGGCAGTTCA 


CAATTTTCAC 


4289 


CATTTAAACA CAAATTTGCT 


GCACAGAATA TCACCATTGC 


AGTTCAAAAC 


AAAACAAAAC 


4349 


AAAAAGTCTT TTGTTTGTGA 


ACACTGATGC AAGAAACTTG 


TTAAATGAAA 


GGACTCTTTA 


4409 


CCCTAGAAGG AAGAGGTGAA 


GGATCTGGCT TGTTTTTAAA 


GCTTTATTTA 


TTAAACCATA 


4469 


TTATTTGATT ACTGTGTTAG 


AATTTCATAA GCAATAATTA 


AATGTGTCTT 


TATGGAATTC 


4529 
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